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Q\ • Abstract 

We present a new algorithm for exactly solving decision making prob- 
lems represented as influence diagrams. We do not require the usual 
. assumptions of no forgetting and regularity; this allows us to solve prob- 

^ ' lems with simultaneous decisions and limited information. The algorithm 

^ _ is empirically shown to outperform a state-of-the-art algorithm on ran- 



domly generated problems of up to 150 variables and 10 64 solutions. We 
show that the problem is NP-hard even if the underlying graph structure 
of the problem has small treewidth and the variables take on a bounded 



(N 
> 

■ number of states, but that a fully polynomial time approximation scheme 



exists for these cases. Moreover, we show that the bound on the number 



, of states is a necessary condition for any efficient approximation scheme. 

7— I ' 

^ ! 1 Introduction 

Influence diagrams [l2| are graphical models for utility-based decision making 
under uncertainty. Traditionally, they are designed to represent and solve situa- 
tions involving a single, non-forgetful decision maker. Limited memory influence 
■ diagrams (hereafter LIMIDs) are generalizations of influence diagrams that al- 

*~i , low for multi-agent and limited information decision problems [la. 123 |0 More 

precisely, LIMIDs relax the regularity and no forgetting assumptions of influence 
diagrams, namely, that there is a complete temporal ordering over the decision 
variables, and that at any step any previously disclosed information is remem- 
bered and taken into account to make a decision. These assumptions fail when 
decisions can be made simultaneously (e.g., by non-interacting agents), or when 
we wish to limit the decision history for computational reasons (e.g., to avoid 
an exponential blow up in the size of policies). 



*A short version of this paper has been accepted for presentation at NIPS 2011. 

1 Historically, Howard and Matheson |12H defined influence diagrams as graphical represen- 
tations of general decision scenarios, and referred to the special cases respecting regularity and 
no forgetting as decision networks. The latter was then used by Zhang et al. [23| to describe 
the general case. Here, we adopt the more recent terminology of Lauritzen and Nilsson [16(1 . 
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Solving a (limited memory) influence diagram refers to finding a combina- 
tion of local decision rules (called a strategy) that maximizes expected utility. 
This task has been empirically and theoretically shown to be very hard [Bj]. In 
fact, we show here that solving a LIMID is NP-complete if we admit only singly 
connected diagrams with number of states per variable no greater than three. 
In addition, we show that an algorithm that produces provably good approx- 
imations within any fixed factor is unlike to exist even for diagrams with low 
treewidth, but that a fully polynomial time approximation scheme exists if we 
further restrict the variables to take on a bounded number of states. 

Under certain graph-structural conditions (which no forgetting and regular- 
ity imply), Lauritzen and Nilsson [l6| show that LIMIDs can be solved by a 
dynamic programming procedure with complexity exponential in the treewidth. 
However, when these conditions are not met, their iterative algorithm is only 
guaranteed to converge to a local optimum. Recently, de Campos and Ji [5j 
formulated the CR algorithm that maps a LIMID into a credal network Q and 
then solves the corresponding marginal inference problem by mixed integer lin- 
ear programming. By that, they were able to solve small problems exactly and 
obtain good approximations for medium-sized problems. 

In this paper, we show that (partial) combinations of local decision rules can 
be partially ordered according to the utility they induce, and that dominance of 
a partial combination implies dominance of all (full) combinations that extend 
it. This greatly reduces the search space of strategies. Using these results we 
develop a generalized variable elimination procedure that computes the optimal 
solution by propagating only non-dominated (partial) solutions. We show ex- 
perimentally that the algorithm can enormously save computational resources, 
and compute exact solutions for medium-sized problems. In fact, the algorithm 
is orders of magnitude faster than the CR algorithm on randomly generated 
diagrams containing up to 150 variables and 10 64 strategies. 

The paper is organized as follows. Section [5] formally describes LIMIDs and 
presents new results about the complexity of solving a LIMID. In Section [31 
we present a new algorithm for computing exact global solutions and discuss 
its complexity. Section [4] contains the modifications necessary to convert the 
algorithm into a fully polynomial time approximation scheme for diagrams of 
bounded treewidth and number of states per variable. The performance of 
the algorithms is evaluated in Section [5] Finally, Sections [5] and [7] contain 
related work and final discussion. To improve readability, some of the proofs 
and supporting results appear in the appendix. 

2 Limited Memory Influence Diagrams 

In the formalism of (limited memory) influence diagrams, the quantities and 
events of interest are represented by three distinct types of variables or nodesH 
Chance variables represent events on which the decision maker has no control, 
such as outcomes of tests or consequences of actions. Decision variables repre- 
sent the options a decision maker might have. Finally, value variables represent 
additive parcels of the utility associated to a state of the world. The set of 
all variables considered relevant for a problem is denoted by U. Each variable 

2 We make no distinction between a node in the graphical representation of a decision 
problem and its corresponding variable. 
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X in U has an associated domain fix, which is the finite non-empty set of 
values X can take on. The elements of tlx are called states. We assume the 
existence of the empty domain f2g = {A}, which contains a single element A 
which is not in any other domain. Decision and chance variables are assumed 
to have domains different from the empty domain, whereas value variables are 
always associated to the empty domain. The domain VL X of a set of variables 
x = {Xi, . . . , X n } C U is given by the Cartesian product Vlx x X • ■ • X 0,x n of 
the variable domains. Thus, an element u G flu defines a state of the world, 
that is, a realization of all actions and events of interest. If x and y are sets of 
variables such that y C x C U, and x is an element of the domain il x , we write 
x^ v to denote the projection of x onto the smaller domain f2 y , that is, x*" E Vt y 
contains only the components of x that are compatible with the variables in 
y. By convention, x^ = A. The cylindrical extension of y € fi y to Q x is the 
set y^ x = {x £ £l x : x^ v = y}. Often, we write X\ ■ ■ ■ X n to denote the set 
{Xi, . . . , X n } and, if clear from the context, X to denote the singleton {X}. 

Some operations over real- valued functions need to be defined. Let / and g be 
functions over domains Q x and Q y , respectively. The product fg is defined as the 
function over domain $l x \j y such that (fg){w) — f{w^ x )g{w^ y ) for any w of its 
domain. Sum of functions is defined analogously: (f+g)(w) = f(w^ x )+g(w^ y ). 
Notice that product and sum of functions are associative and commutative, 
and that product distributes over sum, that is, fg — gf, f + g = g + /, and 
f{g+h) — fg+fh. If / is a function over £l x , and y CM, the sum-marginal ^2 f 
returns a function over £l x \ y such that for any element w of its domain we have 
(Ej, f)M = E, e „t. f(x). Notice that if y n x = 0, then £„/ = /■ Also, the 
sum-marginal operation inherits commutativity and associativity from addition 
of real numbers, and hence Y^ xUy f = E a \ v E„ / = J2 y \ x J2 X /■ If {f3}}ven v 
is a set containing functions f% with domain f2 x , one for each element of Q y , 
we write f y to denote the function that for all w 6 fl x uy satisfies f x (w) = 
(w^ x ). For instance, if X and Y arc two binary- valued variables, and 
f V x = (fx {*i)>fxfa)) = (V 2 > V 2 ) and /| 2 = (0, 1) two functions over {X}, 
then the function % = {ff {xi),f$ (x 2 ), ff ( Xl ), (x 2 )) = (1/2,1/2,0,1). 
If clear from the context, we write 1 to denote a function that returns one to 
all values in its domain and to denote a function that returns always zero. 
More general, we write k to denote a function that returns always a constant 
real value k. Hence, if / and g are functions over a domain fl x and A; is a 
real number, the expressions / > g and f — k denote that f(x) > g(x) and 
f(x) = k, respectively, for all x G fl x . Finally, any function over a domain 
containing a single element is identified with the real number it returns. 

Let C denote the set of chance variables in U, V the set of decision variables, 
and V the set of value variables. The sets C, T> and V form a partition of U. 
A LIMID C consists of a direct acyclic graph (DAG) over the set of variables 
IA annotated with variable types (decision, chance and value), together with a 
collection of (conditional) probability mass functions (one for each chance value) 
and utility functions (one for each value variable) |^| The value nodes in the graph 
have no children. The precise meanings of the arcs in C vary according to the 
type of node to which they point. Arcs entering chance and value nodes denote 
stochastic and functional dependency, respectively; arcs entering decision nodes 

3 When no confusion arises, we write C to refer both to the LIMID and to its directed 
graph. 
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Figure 1: LIMID of the problem in the Example [T] 



describe information awareness or relevance at the time the decision is made. 
If X is a node in C, we denote by pa^ the set of parents of X, that is, the set 
of nodes of C from which there is an arc pointing to X. Similarly, we let chx 
denote the set of children of X (i.e., nodes to which there is an arc from X), 
and fax = pa x U {X} denote its family. The descendants of X are all nodes to 
which there is a directed path from X in C. Each chance variable C in C has 
an associated set {pj : 7r £ ^ P a c } of (conditional) probability mass functions 

quantifying the decision maker's beliefs about states x £ fic conditional on 
a state n of its parents (if C has no parents, it has a single probability mass 
function assigned). We assume any chance variable X G C to be stochastically 
independent from its non-descendant non-parents given its parents. Each value 
variable V £ V is associated with a real- valued utility function uy over f2 P a v , 
which quantifies the (additive) contribution of the states of its parents to the 
overall utility. Thus, the overall utility of a state x £ ficu'D is given by the sum 
of utility functions J2v&v uy{x^ v ). 

For any decision variable D £ T>, a policy So specifies an action for each 
possible state configuration of its parents, that is, Sr> ■ ^ P a D — > &d- If D has 
no parents, then 8d is a function from the empty domain to Qd, and therefore 
constitutes a choice of x £ Hp. The set of all policies 8d for a variable D is 
denoted by An- 

The following artificial examples help to illustrate the use of LIMIDs for 
modeling decision-making problems under uncertainty involving multiple deci- 
sion makers with limited information. 

Example 1. Consider a game where each of the n participants has to decide 
between adding a ball to an urn or removing a ball from it, without knowing 
neither the state of the urn nor the other participants ' decisions. If a participant 
decides for removal when the urn is empty then two balls are put. If the urn 
already contains two balls, a decision of adding a ball is ignored. The goal 
is to finish the seguence of n decisions with no balls in the urn. To avoid 
intercommunication, the participants are kept in separate rooms, and they are 
asked for a decision in some pre- determined ordering unknown to them. Figure]]] 
depicts the graph structure of a LIMID modeling the problem. As usual, we 
graphically represent decision, chance, and value variables by sguares, ovals and 
diamonds, respectively. The nodes D\, . . . , D n represent the decisions available 
to each of the participants in the given ordering. Each chance node Xi represents 
the number of balls in the urn after i decisions have been made (0,1, or 2), and 
is therefore associated to a deterministic function (Xq models the initial state 
of the urn). The value node R has an associated utility function that returns 
1 if the state of X n is the element corresponding to an empty urn, and zero 
otherwise. For any Di, a policy <$D; corresponds to a fixed decision of adding or 
removing a ball for the ith participant. 
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Figure 2: LIMID of the problem in the Example [5] 



Example 2. Consider a slightly modified version of the game in Example [7J 
where each participant is informed of the previous decision. Also, the initial state 
of the urn is disclosed to the first participant (so the first participant is aware 
he/she is the first to make a decision, but all the other remain ignorant about 
their positions in the sequence). Figure^ depicts the graph structure of a LIMID 
modeling the problem. Notice the extra arcs from Di to \, i = 1, . . . , n — 1, 
representing that the decision made by the (i — l)-th participant is known to the 
i-th participant, and the arc Xq — > D\ indicating the known state of the urn to 
the first participant to make a decision. The policy Sd 1 prescribes a decision (to 
add or remove a ball) for the first participant for each possible initial state of 
the urn (empty, one ball or two balls). Similarly, for i = 2, . . . ,n, a policy bo i 
specifies whether the i-th participant should decide to add a ball depending on the 
(i — l)-th participant's decision. For example, if 5d 2 = (£d 2 ( a dd), Sd 2 (remove)) 
denotes the policy of the second participant ( which is a function of the first par- 
ticipant's decision), then policy 5d 2 — (remove, add) prescribes that the second 
participant should remove a ball when the first participant decides for addition, 
and otherwise add a ball. The four possible policies for the second participant 
are Ac 2 = {(add, remove), (add, add), (remove, add), (remove, remove)}. 

Let A = X DeT> Arj denote the space of possible combination of policies. 
An element s = (5r>)r> e x> £ A is said to be a strategy for C. Given a policy 
<5d and a state tt G ^pa^, let p£> denote a probability mass function for D 
conditional on pa D = tt such that pY>( x ) = 1 if as = <$d(7t) and pj)(x) = 
otherwise. Hence, there is a one-to-one correspondence between functions p^ D 
and policies Sd G Ad, and specifying a policy 3d is equivalent to specifying 
p P p D . We denote the set of all functions p p ^ D obtained in this way by Vd ■ A 
strategy s induces a joint probability mass function over the variables in C U T> 

by 

p* = n pc° n p»° . « 

cec DGV 
and has an associated expected utility given by 

E S [C}± Ps(x)J2M^ v ) (2) 

= J2psJ2 uv - ( 3 ) 

cuv vev 

Notice that the two sums in Eq. Q have different semantics. The outer (left- 
most) sum denotes the sum-marginal of the set of variables C WD, whereas the 
inner (rightmost) denotes the overall utility function over ljy e y pa y that results 
from the sum of functions uy . 
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The treewidth of a graph measures its resemblance to a tree and is given by 
the number of vertices in the largest clique of the corresponding triangulated 
moral graph minus one Given a LIMID C of treewidth u), we can evaluate 
the expected utility of any strategy s in time and space at most exponential in 
Li. Hence, if to is bounded by a constant, obtaining E s [£] takes polynomial time: 

Proposition 3. Given a LIMID C with bounded treewidth and a strategy s, 
E s [C] can be computed in polynomial time. 

Proof. Given a strategy, the LIMID can be mapped into a LIMID with no 
decision nodes in polynomial time by replacing each decision node D by a chance 
node with associated probability table p v ^ D . The bounded treewidth implies 
that the number of parents of any decision node is bounded, and then encoding 
p V p D takes time polynomial in the number of elements in f (an input of the 
LIMID). Then the LVE algorithm in Section [3] can be employed, which runs in 
polynomial time since there are no decisions in the input (therefore all sets are 
singletons) and the treewidth is bounded^ □ 

The primary task of a LIMID is to find a strategy s* with maximal expected 
utility, that is, to find s* € A such that 

E S [£]<E S «[£] for alls. (4) 

The value E s * [C] is called the maximum expected utility of C and it is denoted by 
MEU[£]. For most real problems, enumerating all the strategies is prohibitively 
costly. In fact, computing the MEU in bounded treewidth diagrams is NP- 
complete and, as the following result implies, it remains NP-complete in 
even simpler LIMIDs. 

Theorem 4. Given a singly connected LIMID with treewidth equal to 2, and 
with variables having at most three states, deciding whether there is a strategy 
with expected utility greater than a given k is NP-complete. 

The proof, based on a reduction from the partition problem, is given in the 
appendix. Notice that the simple LIMIDs of Examples Q] and [2] (depicted in 
Figures [1] and [21 respectively) meet the preconditions of Theorem 21 and are in 
general most unlikely to be efficiently solvable for any sufficiently large n (in 
fact, the hardness is shown by a reduction of an NP-complete problem to the 
problem of deciding whether the MEU of the LIMID in Example [T] is greater 
than a given threshold). 

The complexity of solving a LIMID can be reduced by removing nodes and 
arcs that are irrelevant to the computation of the maximum expected utility. 
A chance or decision node is called barren if it has no children. Barren nodes 
have no influence on any value node and thus no impact on the MEU 8] . More 
irrelevances can be found by the concept of (i-separation [l9| , which we succinctly 
state in the following paragraph. 

A trail in £ is a sequence of nodes such that any two consecutive nodes are 
connected by an arc. Notice that a trail does not need to "follow" the direction 

4 See also Koller and Friedman [15| for a slightly simpler variable elimination algorithm 
that computes fixed-strategy solutions of bounded treewidth diagrams in polynomial time. 

5 A directed graph is singly connected if the underlying (undirected) graph contains no 
cycles. 
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Figure 3: LIMID of the problem in the Example [5] 
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Figure 4: LIMID of the problem in the Example [5] after the removal of nonreq- 
uisite arcs. 



of the arcs. A trail X, Z, Y is said to be active with respect to a set of variables 
w either if X and Y are both parents of Z and Z or any of its descendants are 
in w, or if at least one of X and Y is not a parent of Z and Z is not in w. A 
trail is blocked by a set of variables w if it contains a triple of consecutive nodes 
which is not active with respect to w. Two sets of nodes x and y are d-separated 
by a set of nodes w if all trails from a node X in x to a node Y in y are blocked 
by w. Intuitively, if X and Y are d-separated by w, then X and Y are irrelevant 
to each other once we know the state of the variables in w. 

A parent node A of a decision node D is nonrequisite (to D) if it is d- 
separated from all the value nodes that are descendant of D given all the re- 
maining parents and D. The arc from X to D is then said to be a nonrequisite 
arc. Likewise barren nodes, nonrequisite arcs can be removed without affecting 
the MEU @, [T|3] , and by doing so a node may become barren. We say that 
a LIMID is minimal if it contains no nonrequisite arcs and no barren nodes. 
Given a LIMID we can obtain its corresponding minimal diagram in polynomial 
time by repeatedly removing nonrequisite arcs and barren nodes 3- For 
the rest of this paper, we assume LIMIDs to be minimal. 

The following example illustrates the concepts of d-separation and nonreq- 
uisite arcs. 

Example 5. Consider a modified version of the multiplayer game in Example^ 
where in addition to the previous participant 's decision also the current state of 
the urn is disclosed to a participant before he/she makes a decision. The graph 
structure of a LIMID representing this problem is depicted in Figure [3J Notice 
the arcs Xi-% — > Di, for i = 1, . . . , n— 1, representing the known state of the urn 
to each participant. Consider the arc D\ — > D 2 . All trails connecting D\ and R 
are blocked by fa£> 2 \{Z?i} = {D 2 ,Xi}. Therefore D\ is a nonrequisite node for 
D 2 and the arc D\ — > D 2 can be removed without altering the MEU. In fact, all 
arcs Di-x — > Di, for i = 1, . . . ,n, are nonrequisite and can be safely removed. 
On the other hand, Xq is a requisite node for D\ because, for instance, the trail 
Xo, X\,..., X n , R is active given the set f&D 1 \ {Xq} = {Di}. Intuitively, the 
information provided by the arcs £>i_i — > Di (i.e., the previous participant's 
decision) does not help in making a decision once the current state Aj_i of the 
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urn is known. Figure^ depicts a minimal version of the diagram in Figured 



Under the usual assumptions of complexity theory, when a problem is NP- 
hard to solve the best available options are (i) trying to devise an algorithm that 
runs efficiently on many instances but has exponential worst-case complexity, or 
(ii) trying to develop an approximation algorithm that for all instances provides 
in polynomial time a solution that is provably within a certain range of the 
optimal solution. In Section [31 we take option (i), and present an algorithm that 
efficiently computes optimal solutions for many LIMIDs, but runs in exponential 
time for many others. Given e > 0, an e-approximation algorithm (for solving a 
LIMID) obtains a strategy s such that (1 + e) E s [£] > MEU[£]. As the following 
result indicates, alternative (ii) is most likely unfeasible, even if we consider only 
diagrams of bounded treewidth. 

Theorem 6. Given a singly connected LIMID £ with bounded treewidth, (unless 
P=NP) there is no polynomial time e-approximation algorithm, for any < e < 
2 —1, where 8 is the number of numerical parameters (probabilities and utilities) 
required to specify £. 

We defer the proof to the appendix. However, we show in Section 0] that, 
differently from the general case, there are LIMIDs, namely, those with bounded 
treewidth and number of states per variable, for which a polynomial time e- 
approximation algorithm exists. 

Let £ be a LIMID and let k and K denote, respectively, the smallest and the 
greatest utilities associated to any of the value variables, that is, for all V G V 
we have that k < uy < K, and there are V and V such that uy{x) = k and 
uy(x') = K for some x G ^pa t , arid x' G ^ pav „- Assume k < K (otherwise 
the MEU is trivial), and let £' be the LIMID obtained from £ by setting each 
utility function uy associated to a value node V to u' v — (uy — k) / '{K — k). Note 
that by design each function u' v takes values on [0, 1]. The following well-known 
result allows us to focus on scaled utility functions. 

Proposition 7. For any strategy s, E s [£] = [K — k) E s [£'] + k\V\. 

Proof. The case of a single value node has been shown by Cooper and 
Shachter and Peot (2l| . The extension to multiple value nodes is straightfor- 
ward. For any strategy s we have that 



E.[£'] = £ < 



•v 



cuv vev 




cuv 




which, since p s is a probability distribution on C U £>, equals 



1 



(E s [£]-k\V\) . 



K-k 



Hence, the result follows. 



□ 
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In the rest of the paper we consider only LIMIDs with utilities taking values 
in some subset of the real interval [0, 1], which due to Proposition [7] does not 
incur any loss of generality. 

3 Solving LIMIDs Exactly 

The basic ingredients of our algorithmic framework for representing and han- 
dling information in LIMIDs are the so called valuations, which encode informa- 
tion (probabilities, utilities and policies) about the elements of a domain. Each 
valuation is associated to a subset of the variables in U, called its scope. More 
concretely, a valuation with scope a; is a pair (p, u) of bounded nonnegative 
real- valued functions p and u over the domain Q x ; we refer to p and u as the 
probability and utility part, respectively, of 0. Often, we write <f> x to make 
explicit the scope x of a valuation 0. For any x C U, we denoted the set of 
all possible valuations with scope x by $ x . The set of all possible valuations is 
thus given by $ = [j x cu^ x - The set $ is closed under two basic operations 
of combination and marginalization. Combination represents the aggregation of 
information and is defined as follows. 

Definition 8. If = (p,u) and ijj — (q,v) are valuations with scopes x and y, 
respectively, its combination <j>®ip is the valuation (pq,pv + qu) with scope xUy. 

Marginalization, on the other hand, acts by coarsening information: 

Definition 9. 1/0 = (p, u) is a valuation with scope x, andy is a set of variables 
such that y C x, the marginal <fi^ v is the valuation Q2x\y Pi Ex\ y u ) scope 
y. In this case, we say that z = x\y has been eliminated from 0, which we 
denote by <f>~ z . 

Notice that our definitions of combination and marginalization differ from 
previous works on LIMIDs (e.g., [16|). which usually require a division of the 
utility part by the probability part. The removal of division turns out to be an 
important feature when we discuss maximality of valuations later on. 

In terms of computational complexity, combining two valuations and tp 
with scopes x and y, respectively, requires 3|Q x uj/| multiplications and fizuyl 
additions of numbers; computing <p^ v , where y C x, costs If^uyl operations of 
addition. In other words, the cost of combining or marginalizing a valuation is 
exponential in the cardinality of its scope (and linear in the cardinality of its 
domain). Hence, we wish to work with valuations whose scope is as small as 
possible. The following result shows that our framework respects the necessary 
conditions for computing efficiently with valuations (in the sense of keeping the 
scope of valuations obtained from combinations and marginalizations of other 
valuations minimal). 

Proposition 10. The system ($, U, (g>, j) satisfies the following three axioms of 
a (weak) labeled valuation algebra JlA. \2Bl. 

(Al) Combination is commutative and associative, that is, for any (f>±, <f>2, (f>3 S 
$ we have that 

4>1 ® 02 = 02 ® 4>1 , 
01 ® (02 ® 03) = (01 ® 02) ® 03 • 
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(A2) Marginalization is transitive, that is, for <j) z G <& z and y C x C z we have 
that 

= 4& . 

(A3) Marginalization distributes over combination, that is, for <j) x £ $ x , 4> y £ 
$ y and iCzCiUi/ we have that 

Proof. (AH]) follows directly from commutativity, associativity and distributivity 
of product and sum of real-valued functions, and (A|2j) follows directly from 
commutativity of the sum-marginal operation. To show consider any two 

valuations (p, u) and (q, v) with scopes x and y, respectively, and a set z such 
that iCzCiUi/. By definition of <g) and .J., we have that 

[(p,u)®(q,v)] lz = I Pq, (P V + 1 U ) 

\xUy\z xUy\z 

Since xUy\z — y\z, and p and u are functions over Q x , it follows that 



Y pq > Y (p v + Q u )\ = \p^2<1,P^2 v + u ^2q 

xUy\z xUy\z I \ y\z y\z y\z 



= (p,u)<E> ( Y <?: v 
K y\z y\z 



which equals (p, y) ® {q, v) 



iynz 



□ 



The following is a direct consequence of (A[3]) that is required to prove the 
correctness of the variable elimination procedure. 

Lemma 11. If <fi x £ &x, 4*y & $y, z <Z y and z fl x = 0, then (cf) x <g> (f> y )~ z = 

<i>x®<l>- z - 

Proof. Let w — x U y\ z. Since x fl z = 0, it follows that x C w C x U y. Hence, 
by definition of elimination and (A[3]), we have that 

{(/) x <E><j>y)- Z = {(j> X ® <j)y) lW 
= fa ® ■ 



But y D w = y \ z. Thus, 



□ 



The following result shows how valuations can be used to compute expected 
utilities for a given strategy. 



Proposition 12. Given a LIMID £ and a strategy s = (5d)dgv £ A, let 



A 

s = 


(g)(/c c ,0) 


® 


® (pS d .o) 


<8> 






_cec 




.-Dec 




_vev 



(5) 



where, for each D, p^ D is the function in Vd associated with policy 5d- Then 
<j)f equals (1,E S [£]). 
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Proof. Let p and u denote the probability and utility part, respectively, of <j>\® . 
By definition of combination, we have that <p s = (p s ,p s J2vev u v), where p s = 
YlxeCuuPx* as m ©• Since p s is a probability distribution over C U V, it 
follows that p = Y, x &-i CUT ,Ps{ x ) = 1- Finally, u = J2cuvPsJ2vev u V' which 
equals E 8 [C] by ©. □ 

Hence, given a strategy we can use a variable elimination procedural to 
compute its expected utility in time polynomial in the largest domain of a vari- 
able but exponential in the width of the elimination ordering [e.g.. Il5l Section 
23.4.3] Q However, computing the MEU in this way is unfeasible for realistic di- 
agrams due to the large number of strategies that would need to be enumerated. 
For example, a simple LIMID consisting of a decision variable with four chance 
nodes as parents and one value node as child contains 10 3 = 10 81 strategies in 
A, if the decision variable has 10 states and each parent has 3 states. 

In order to avoid having to consider all possible strategies, we define a partial 
order (i.e., a reflexive, antisymmetric and transitive relation) over $ as follows. 

Definition 13. For any two valuations <f> = (p,u) and ip — (q,v) in $, we say 
that ip dominates <f> (conversely, we say that <f> is dominated by ij)), and we write 
4> < 4 1 ; if 4> an d V' have equal scope, p < q, and u < v. 

If 4> and "0 have scope x, deciding whether ip dominates <f> costs at most 
2|f2a;| operations of comparison of numbers. The following result shows that the 
algebra of valuations is monotonic with respect to dominance. 

Proposition 14. The system ($, U, ®, -l, <) satisfies the following two addi- 
tional axioms of an ordered valuation algebra ]1 A l. 

(AJf.) Combination is monotonic with respect to dominance, that is, 
if <t>x < ip x and <j> y < ip y then (4> x ® <f> y ) < (ip x ® ip y ) . 

(A5) Marginalization is monotonic with respect to dominance, that is, 

if 4>x < then ^ < ib^ . 

Proof. (A[4]). Consider two valuations (p x ,u x ) and {q x ,v x ) with scope x such 
that (p x ,u x ) < (q x ,v x ), and two valuations (p y ,u y ) and (q y ,v y ) with scope 
y satisfying (p y ,u y ) < (q y ,v y ). By definition of <, we have that p x < q x , 
u x < v x , p y < q y and u y < v y . Since all functions are nonnegative, it follows 
that p x p y < q x q y , p x u y < q x v y and p y u x < q y v x . Hence, (p x ,u x ) ® (p y ,u y ) = 

(PxPy,PxU y +p y U x ) < (q 

X Qy i Qx ~\~ Qy V x ) = (q x ,v x ) ($> (q y ,v y ). (^EJ). Let y be a 
subset of x. It follows from monotonicity of < with respect to addition of real 
numbers that 



{Px,u x ) iy = ( 22p x ,22u x ) < I }^q x ,yiv x \ = {q x ,v x ) iy . 

K x\y x\y 

Hence, the result follows. □ 




6 Variable elimination algorithms are also known in the literature as fusion algorithms [22| 
and bucket elimination [f|. 

7 The width of an elimination ordering is the treewidth of the tree decomposition it induces 
and can be computed in time polynomial in the number of variables. It also equals the 
maximum cardinality of the scope of a valuation in the variable elimination procedure minus 
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The algorithm we devise later on operates on sets of ordered valuations. 

Definition 15. Given a finite set of valuations \& C <£>, we say that 4> G \& is 
maximal if for all ip S l J / such that (f> < ip it holds that ip < <f>. The operator 
max returns the set max(^I') of maximal valuations of "P. 

If "J x is a set with m valuations with scope x, max^^) can be obtained by 
m 2 comparisons <fr <ip, where VO £ X ^x- 

We extend combination and marginalization to sets of valuations as follows. 

Definition 16. // ty x and ^ y are any two sets of valuations in $, 

® = {4>x ®(t> y :<t>x£ ®x,4>y G ^ y} 

denotes the set obtained from all combinations of a valuation in ty x and a val- 
uation in 

Definition 17. // ^ x C $ x is a set of valuations with scope x and i/Ci, 

' = Wx V :'V . >!' .! 

denote the set of valuations obtained by element-wise marginalization of valua- 
tions to y. 

It can be checked that sets of valuations with combination and marginal- 
ization defined element-wise satisfy axioms (A[JjH(A[3]), and therefore form a 
valuation algebra. Hence, Lemma [TT] applies also for sets of valuations with 
marginalization and combination defined as above. 

Lemma 18. // ty x C $ x and ^ y C $ y are two sets of valuations with scope x 
and y, respectively, and z is a set of variables such that z C y and z PI x = 0, 
then (t> x (g) * y )~ 2 = ^ x ® fy z . 

Proof. The result follows from element-wise application of Lemma [TTJ to ((f> x ® 

<f> v r z e (*i ® *„)-*. □ 

We are now ready to describe the LVE algorithm, which solves arbitrary 
LIMIDs exactly. Consider a LIMID £, and an ordering X\ < • • • < X„ over 
the variables in CUD. The algorithm is initialized by generating one set of 
valuations for each variable X in U as follows. 

Initialization: Let Vo be initially the empty set. 

1. For each chance variable X £ C, add a singleton ^x — {(Px* >0)} to Vo 

2. For each decision variable X £T>, add a set of valuations ^ x — {(Px* > 0) : 

eP x }toV . 

3. For each value variable le V, add a singleton ^x = {(L^x)} to Vo- 

Once Vo has been initialized with a set of valuations for each variable in the 
diagram, we recursively eliminate a variable Xi in C U T> in the given ordering 
and remove any non-maximal valuation: 

Propagation: For i = 1, . . . , n do: 
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1. Let Bi = 0. Remove from Vi_i all sets whose valuations contain Xj in 
their scope and add them to Bi. 

2. Compute * 4 = max([(g)^ eBi *]~ Xi ). 

3. Set V 4 4 V,_! U 

Termination: Finally, the algorithm outputs the utility part of the single 
maximal valuation in the set (^)^ eV ^, that is, the algorithm returns the real 
number u such that (p, u) E max(0^ £V '&). u is a real number because the 
valuations in &)^ eV *B have empty scope and thus both their probability and 
utility parts can be identified with real numbers. 



The elimination ordering X\ < • • • < X n can be determined using the stan- 
dard heuristics for variable elimination in Bayesian networks such as minimizing 
the number of fill-ins or the cardinality of the domain of the neighbor set [pjlflij ]. 

Differently from other message-passing algorithms that obtain approximate 
solutions to LIMIDs by (repeatedly) propagating a single valuation (e.g., the 
SPU algorithm |l6|), the LVE algorithm computes exact solutions by propagat- 
ing many maximal valuations that correspond to partial combinations of local 
decision rules. The efficiency of the algorithm in handling the propagation of 
several valuations derives from the early removal of valuations performed by the 
max operation in the propagation step. 

Consider the set — {4>s '■ s £ A}, where each <j> s is given by (0). It is not 
difficult to see that 



®{&E°.o)} 

cec 

*xGV 



{(p^,0):p^£V D } 



Dev 



{(W)} 



vev 



Hence, by PropositionQ2]we have that each in "I^ is a valuation with prob- 
ability part one and utility part equal to the expected utility of some strategy in 
A. Since the relation < induces a strict (linear) order over theMEU of the 
diagram equals the utility part of the (single) valuation in max^j^). The vari- 
able elimination procedure in the propagation step is responsible for obtaining 
max((^) $eV ty) = max(5 , £ 15 ) more efficiently by distributing max and \, over 
®*xev wn i cn allows for a significant reduction in the cardinalities of sets 
and scopes of valuations produced. The following result states the correctness 
of the algorithm. 

Theorem 19. Given a LIMID C, LVE outputs MEU[£]. 

The proof, which requires some technicalities, is in the appendix. 



3.1 Complexity Analysis 

Assume that decision nodes are parentless. We will show in Section 13.41 later 
on that we can transform any given LIMID into an equivalent model in which 
decision nodes have no parents. Parentless decisions allows us to avoid having 
to deal with sets whose cardinality is exponential in the number of parents. 
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The time complexity of the algorithm is given by the cost of creating the sets 
of valuations in the initialization step plus the overall cost of the combination 
and marginalization operations performed during the propagation step. Re- 
garding the initialization step, the loops for chance and value variables generate 
singletons, and thus take time linear in the input. Since decision nodes have no 
parents, there are po — \£Id\ policies in Ajj (which coincides with the number 
of functions in Vd) for each decision variable D. There is one valuation in the 
corresponding set r> added to Vo for every policy in Aj> Let p = maxogu Pd 
be the cardinality of the largest policy set. Then the initialization loop for de- 
cision variables takes 0(\T>\p) time, which is polynomial in the input. Let us 
now analyze the propagation step. As with any variable elimination procedure, 
the running time of propagating (sets of) valuations is exponential in the width 
of the elimination ordering, which is in the best case given by the treewidth of 
the diagram. Consider the case of an elimination ordering with bounded width 
u), and a diagram with bounded number of states per variable n. Then the 
cost of each combination or marginalization is bounded by a constant, and the 
complexity depends only on the number of operations performed. Moreover, we 
have in this case that p < k. Let v denote the cardinality of the largest set 

for i = 1, . . . , n. Thus, computing 4^ requires at most i/M - 1 operations of 
combination (because that is the maximum number of sets that we might need 
to combine to compute (^>^, g g. ^ in the propagation step) and v operations of 
marginalization. In the worst case, v is equal to p' 15 ' < 0(k' t> ^), that is, all sets 
associated to decision variables have been combined without discarding any val- 
uation. Hence, the worst-case complexity of the propagation step is exponential 
in the number of decision variables, even if the width of the elimination ordering 
and the number of states per variable are bounded. Note however that this is a 
very pessimistic scenario and, on average, the removal of non-maximal elements 
greatly reduces the complexity, as the experiments in Section [5] show. 

3.2 Strategy Selection 

Most likely, one is not only interested in the maximum expected utility of a 
LIMID but also in an optimum course of action for every possible scenario, that 
is, in an optimal strategy that obtains the MEU. LVE can be easily modified 
to provide an optimal strategy by storing at each step the policies associated 
to non-dominated valuations as follows. For each valuation created in the ini- 
tialization step associate a list which is empty unless the valuation refers to a 
policy of a decision variable, in which case the list contains the associated policy. 
Now, any valuation <f>i 6 ^ factorizes as (£> ■ ■ ■ <8> '4 ) \B i \)~ Xi , where each ipj 
is an element of a different set $ in Bi. For i = 1, each ipj is associated to a 
list. Assign a list to each cf>i which equals the concatenation of the lists of its 
factors ipj. Thus, the list contains a choice of policies for all decision variables 
D such that X\ £ f a£>. For each i, associate a list to each fa G vffj which equals 
the concatenation of lists of its factors. An optimal strategy s* is thus easily 
obtained from the list associated to <j) n € max(0^ eVi The handling of lists 
can be implemented by simple pointers to valuations in sets in Vo, and therefore 
the asymptotic complexity of the algorithm is unaltered. 
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3.3 Reverse Topological Ordering 

The valuations used by LVE specify twice as many numbers as the cardinality 
of the domain of their associated scope. It is possible to decrease by a factor of 
two the number of numerical parameters per valuation the algorithm needs to 
handle by constraining the elimination of variables to follow a reverse topological 
ordering in the diagram, that is, by requiring each variable to be processed only 
after all its descendants have been processed. As the following result shows, any 
reverse topological ordering produces valuations whose probability part equals 
one in all coordinates. 

Proposition 20. If X\ < ■ ■ ■ < X n denotes a reverse topological ordering over 
the variables inCWD, then for i = 1, . . . , n the valuations in ^i have probability 
part p=l, where 1 is the function that always returns the unity. 

Proof. We show the result by induction on i. Regarding the basis, we have from 
the reverse topological ordering that X\ is a variable containing only value nodes 
as children. Hence, B\ = U {{(l,tiy)} : V € ch^}, where by definition 

equals {{px* 1 >0)} ^ -^i i s a chance node, and {(p x Xl ,0) : Px* 1 G ^Xi} 
if it is a decision node. It follows that 



*i = max (E Pxx 1 > 2 Pxi 1 2 uv 

Since for any ir £ ^pa Xi > Px x i s a probability mass function over X\ , we have 
that p = Px* 1 = 1- Assume by inductive hypothesis that the result holds 
forl,...,i-l,andlettf I = ® VeBAVo tf. Then * 4 = max([0^ eB . nVo tf]®**). 
By inductive hypothesis all valuations in a set in Bi \ Vo have probability part 
p — 1. Hence, by definition of combination, the valuations in \I> X contain also 
probability part equal to one. The reverse topological ordering implies that 
by the time variable Xi is processed in the propagation step, all its children 
have been processed. Hence, the only element of Bi R Vo is the set \&x 4! which 
equals {(p x Xi ,0)} if Xi is a chance node, {(p x Xi ,0) ■ Px* 1 7^;} if 
is a decision node, and {(l,uxi)} if it is a value node. Thus, we have that 
if>i = max(*Xi ® *x)- The case when Xi is a value node is immediate, since any 
valuation in \I>j is the result of a combination of two valuations with probability 
part equal to one. If Xi is not a value node then 

= max ^{ (j^Px^ : (Px** > °) e , (1, u x ) £ tf*}^ 

= max ^{ :(Px?S0)e*x i ,(l,«x)e* x |^ , 

since p x . is a probability mass function for any 7r £ ^ P a x • □ 

The result states that if we assume a reverse topological elimination ordering, 
then LVE needs to care only about the utility part of the valuations. 
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Figure 5: A simple LIMID (on the left) and its transformed MEU-equivalent 
(on the right). 



3.4 Decision Nodes with Many Parents 

The computational cost of adding a set to Vo for a decision variable D is a 
serious issue to the algorithm when decision nodes have many parents. To see 
this, consider again the example of a LIMID with a single decision node D with 
four chance nodes as parents and one value node as child. If the decision variable 
has ten states, and each parent has three states, the set r> contains 10 3 = 10 81 
valuations! Fortunately, we can transform any diagram in an equivalent model 
in which decision nodes are parentless, and yet provides the same maximum 
expected utility. 

Transformation 21. Consider a LIMID C. For each decision node D in C with 
at least one parent, remove D and add m = |f2 P a D | chance nodes X\, . . . ,X m 
and m decision nodes D\, . . . , D m with domains flxi = = &d (for i = 
1, . . . ,m). Add an arc from every parent of D to each of X\, . . . , X m , an arc 
from every Xi to Xi + \, with i < to, and an arc from every Di to X$ , i = 
1, . . . , to. Finally, add an arc from X m to each child of D. Assume an ordering 
tti < • • • < 7r m of the states in 51 pa . For each node Xi, associate a function 

P a x 

p x . 1 such that for x G £!f ax > 

!1, if (a^ pa D ^ TTj and x^ x * = x 1 **- 1 ) 
or (x^o = 7Vi and x^ Xi = x^ Xi - % = x^ Di ) 
0, otherwise. 

Finally, the functions p^ x for each child X of D have D substituted by X m in 
their scope, without altering the numerical values. 

Figure [5] depicts a simple LIMID with three nodes (on the left) and the 
diagram obtained by applying Transformation EH (on the right). Two things 
are noteworthy. First, the treewidth of the transformed diagram (for any given 
LIMID) is increased by at most 2, because the subgraph containing the new 
nodes, the parents of D and the children of D is triangulated and contains cliques 
with at most \p& D U {Xi, Xi—i, Di}\ variables]! Second, the functions p\ Xi 
do not correspond to sets of probability mass functions because, for instance, 
Ex p\ Xi in) = for y 6 ^pa x such that — 71^ and y^ x *- 1 ^ yWi ; anc | 

8 Since the treewidth is given by the size of largest clique in the triangulated moral graph 
minus one, |pa D | is a lower bound on the treewidth of the original graph. 
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hence there is n £ ^ P a x for which J2x Px (v) ^ Therefore, strictly speak- 
ing, the transformation does not generate a new LIMID, but, as the following 
result shows, a model with equal MEU that fits into the framework of LVE. 

Proposition 22. Let £ be the result of Transformation \2l\ on a LIMID L. 
Then 

MEU[£'] = MEU[£] . 

The proof is in the appendix. For each decision variable D in the original 
LIMID, the transformed model contains m chance variables specifying m|f2£)| 3 
values, and m decision nodes with \Qd\ states. If the treewidth of the original 
diagram is bounded, then m is bounded by a constant and the transformation 
takes polynomial time. In terms of the running-time complexity of the LVE al- 
gorithm, the transformation substitutes a set ^ r> (generated in the initialization 
step), which would contain |f2.o| m valuations with scope fa£), with m sets $ ' £> i 
containing |Qd| valuations with scope D, and m singletons containing valuations 
with scope f &x { = ^-1, Di \ Upa^,. In the example of a ten-state decision 
variable with four ternary parents, the transformation generates 3 4 = 81 sets 
with 10 elements each, and 81 singletons, a reduction of more than 78 orders of 
magnitude in the space and time required to initialize the algorithm. 

If no valuation is ever discarded by the max operation in the propagation 
step (i.e., if all valuations generated are maximal) of LVE, after processing nodes 
Di,Xi, . . . , X m -\, D rn in the transformed diagram, a set containing \ ftrj \ m val- 
uations with scope {X n }L)pa D is created. This is the same number of valuations 
(with the same scope) that would contain the set in Vo in the initialization 
step if LVE was run with the original diagram. Thus, the transformed diagram 
has a worst-case time and space complexity similar to the original diagram 
(with some overhead due to the increase in the number of variables). However, 
as the removal of non-maximal elements reduces drastically the running time 
and memory usage, in practice, the transformation leads to an enormous saving 
of computational resources. 

4 An FPTAS For Solving Bounded LIMIDs 

According to Theorems 0] and solving a LIMID exactly is NP-hard even if the 
diagram has bounded treewidth and number of states per variable. In addition, 
obtaining an e-approximation is hard if the number of states per variable is not 
bounded. In this section, we show that for diagrams with bounded treewidth 
and number of states per variable it is possible to obtain a (multiplicative) 
fully polynomial time approximation scheme (FPTAS), that is, a family of e- 
approximation algorithms that runs in time polynomial in 1/e and in the input 
size. 

Definition 23. Given a relation R on VP C a set W is called an i? -covering 
(for ) if for every <f> £ \& there is tp £ \P' such that <f)Rip. 

For example, the set max(5') is a <-covering for ^. For any real number 
a > 1, we define a relation < a as follows. 

Definition 24. If <p — (p,u) and ip = (q,v) are two valuations in $, then 
<j) < a ip if <f> and ip have equal scope and p < aq and u < av. 
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Notice that when a = 1 the relation < a as defined above is equivalent to 
the partial order < in Definition 1131 If (p and ip have scope x, deciding whether 
<p < a ip costs at most 2 1 Q x | operations of comparison of numbers . Intuitively, the 
relation < a measures the maximum amount of information lost in representing 
(p, u) by (q, v), that is, (q, v) approximates (p, u) with a loss no greater than a in 
each of its coordinates. Notice that < a is neither transitive nor antisymmetric, 
and therefore not a partial order. If a > /3 > 1, then <p <p ip implies (p < a ip. 
In particular, we have that <p> < ip implies <p <a V 1 - Hence, any <-covering is 
also a < a -covering (but not the contrary). 

For any real number a > 1 we define an equivalence relation = a over valu- 
ations as follows. 

Definition 25. For any two real-valued functions f and g over domain ft x , 
f = a g if for all x S VL X either f(x) = g{x) or 

f{x) > 0,g(x) > and [\og a f(x)\ = [log a g(x)\ . 

If 4> — (P> u ) an d V* = v ) are an y t wo valuations, then <f> = a ip if <fi and ip 
have equal scope, and p = a q and u = a v. In this case, we say that cj) and are 
a-equivalent. 

If cp = a ip then (p < a ip and ip < a (p. Hence, for any two a-equivalent 
valuations (p and ip, approximating (p by ip incurs the same worst-case error as 
approximating ip by (p. Given any finite set of valuations ^, an <Q,-covering for 
^/ can be obtained by recursively discarding any of two a-equivalent valuations 
until no two a-equivalent valuations remain in the set. We denote by G a an 
operation that returns an < a -covering for ^ in this way: 

Definition 26. For any finite set of valuations ^ C the operation G Q returns 
a set G Q ,(^') C <f obtained by the following procedure: 

1. Set G a {^) initially to the empty set. 

2. Remove an element <p from ^ . 

3. If there is no ip 6 G ct (5') such that <p = a ip then add (p to G a (\E'). Else 
discard (p. 

4- If ^ is not empty, go back to step 2. 

Notice that step 2 in the definition of G Q does not specify how an element 
(p from "J should be selected. In our implementation, we randomly select a 
valuation to test. The following result justifies our interest in G a . 

Lemma 27. // ^ x is a finite set of valuations with scope x and probability and 
utility part not greater than one, then G a (^ x ) is an < a -covering for ty x with 
at most (1 — [log a tj) 2 '^ x ' elements, where t is the smallest (strictly) positive 
number in the probability or utility part of a valuation in ty x . 

Proof. To see that G a (^ x ) is an < a -covering for fy x , note that, by definition of 
G a , for any (p, u) £ ^ x there is (q,v) £ G a (^ x ) such that p = a q and u = a v. 
Hence, for all x 6 fl x , either p(x) = q(x) or there is a negative integer k such 
that a k < p(x) < a k+1 and a k < q(x) < a k+1 . From this, it follows that 
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p < aq. Analogously, we have that either u(x) = v(x) or u(x) < a k+1 < av(x). 
Hence, (p, u) < a (q,v). 

Let us now show that the upper bound on the cardinality of G ct (5 , a; ) holds. 
For any x £ Sl x , the a-equivalence relation partitions the two-dimensional space 
{(p(x),u(x)) : (p,u) £ ^x} in (1 — |l°g a t\) 2 subsets of a-equivalent valuations 
with respect to x (to see that, note that p(x) < t implies p(x) — 0, [log a p(x)\ is 
an integer for p(x) > t, and there are — Llog Q tj (distinct) integers between t and 
one. The same applies for u(x)). Hence, = a partitions ^ x in (1 — [log Q tj ) 2 ' nx ' 
subsets of a-equivalent valuations. Since G a discards any two a-equivalent 
valuations from ^> x , G a (^ x ) have at most (1 — [log^ ij ) 2 I £7 = elements. □ 

For any LIMID with bounded treewidth and number of states per variable, 
it is possible to obtain in polynomial time an elimination ordering such that 
the cardinality of the domain of any valuation obtained by variable elimination 
is bounded [lj, Thus, Lemma |2"T1 guarantees that G Q produces sets whose 
cardinality is polylogarithmic in the smallest positive value in the set (because 
the bounded treewidth and number of states imply \£l x \ is a constant). By 
definition of combination and marginalization, any value in the probability or 
utility part of a valuation obtained during variable elimination is a polynomial 
on the input numbers, and so G a (^i) returns an < Q -covering for whose car- 
dinality is polynomially bounded by the smallest positive value (probability or 
scaled utility) in the input. The polynomial running time of the approximation 
algorithm we devise here mainly derives from this result. 

For notational convenience, we define a new operation that combines set 
combination and G a as follows. 

Definition 28. If "$! x G $ x and ^ y 6 $j, are finite sets of valuations, we define 
their a -combination ^ x ® Q ty y for any a > 1 as G a (JS> x ® ^y)- 

The following example shows that © Q is not associative. Consider the fol- 
lowing three sets of valuations over the empty domain 

*! ={(1,0), (0.3,0)}, 
* 2 = {(0.5,0), (0.1,0)}, 
tf 3 = {(0-05,0), (0.4,0)}, 

and assume G a selects always the valuation with minimum probability part 
among a set of a-equivalent valuations (over the empty domain). If a = 10, it 
follows that 

(* i ©a $2) ©a * 3 = G a ({(0.5, 0), (0.1, 0), (0.15, 0), (0.03, 0)}) ffi Q {(0.05, 0), (0.4, 0)} 
= {(0.1, 0), (0.03, 0)} ® a {(0.05, 0), (0.4, 0)} 
= G Q ({(0.005, 0), (0.04, 0), (0.0015, 0), (0.012, 0)}) 
= {(0.04,0), (0.0015,0)}. 

On the other hand, we have that 

* 1 ©a (*2 ©a * 3 ) = {(1, 0), (0.3, 0)} © Q G Q ({(0.025, 0), (0.005, 0), (0.2, 0), (0.04, 0)}) 
= {(1, 0), (0.3, 0)} © Q {(0.025, 0), (0.005, 0), (0.2, 0)} 
= {(0.025, 0), (0.0075, 0), (0.2, 0)} . 
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Note that associativity fails also if we identify a-equivalent valuations: 



Llog Q ([*! ® Q * a ] ® Q * 3 )J = {(-2, 0), (-3, 0)} 

^{(-2,0), (-3,0), (-1,0)} 

= [\0g a ® a [* 2 ©a *3])J , 

where |_l°gaJ is applied element-wise. Therefore, (<i>i (B a Vf^) ©a ^3 7^ *i ©a 
(^2 ©0*3), but, as we show in the following lemma, they are both < a 2 -coverings 
for \&i <g> ^2 ® ^3- It is in this last feature of a-combination that we are mainly 
interested. 

Lemma 29. Let oi, . . . , a m denote nonnegative integers, and v&i, \t X) . . . , ty m , 
denote finite sets of valuations such that for i = 1, . . . , m, Nf^ is a < a a i -covering 
for ^i. Then 4^ © a • • • © a ty' m (where the operations are applied in any order) 
is a </3-covering for ^1 ® • • • ® \l/ m , where f3 — a m_1+ £i=i ai . 

Proof. We prove the result by induction on k. First notice that <f> < a ip is the 
same as <fi < («, 0)®^>. The basis (k = 1) follows immediately, as ^[ is an < a a i- 
covering for and j3 — a ai . Assume for 1 < k < m that (B a ■ • • ©a ^fe-i 
is a < 7 -covering for ^1 <g) • • • ® ^jt-i, where 7 = a* 1-2-1- ^^ 1 Qi . Since is a 
<a" fc -covering for it follows from (AH]) and the inductive hypothesis that 
for any 4> € *i © • • • © there is </>' G *i © a • • • ffi Q © such that 

< (t',0) © 0', where 7' = 7a° fc = a fe ~ 2+ £- fe =i <n. But, by Lemma [271 for 
any $ € #i ffi a • • • ©a © there is cj>" e #i © Q • • • ©a *' fe such that 

<f>' < (a, 0) <g> </>". Thus, for any there is </>" such that <f> < ( 7 ',0) © 0' < 
(7', 0) ® (a, 0) ® 0". By transitivity of <, it follows that < (/3, 0) ® </>", where 

For any given approximation factor e > 0, LVE can be turned into an FP- 
TAS by setting a = 1 + e/(2\U\), and replacing combination of sets with a- 
combination. The following algorithm, called the e-LVE algorithm, more 
formally describes the procedure. The only difference with respect to LVE is in 
the propagation step. 

Initialization: Let Vg be initially the empty set. 

1. For each chance variable X 6 C, add a singleton — {(Px X >0)} ^° ^0- 

2. For each decision variable X £ T>, add a set of valuations ^x — {(p V x x > 0) '■ 

P v x x eV x }toV Q . 

3. For each value variable X £ V, add a singleton ^ x = {(L u x)} to Vo- 
Propagation: For i = 1, . . . , n do: 

1. Let £> s - = 0. Remove from V' i _ 1 all sets whose valuations contain Xi in 
their scope and add them to B^. 

2. Compute = max([$i ffi Q • • • © Q <I>| B <|] _X '), where, for j = 1, . . . , 

3. Set V; 4 V?_! U 
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Termination: Return (p,u) € max((^)^ eV , 

The precise order in which the a-combinations are performed in the com- 
putation of a in e-LVE is irrelevant to the correctness of approximabil- 
ity results. Note, however, that different orders may lead to different so- 
lutions. For a given LIMID C and elimination ordering X\ < ■ ■ ■ < X n , 
let denote the set of valuations generated by LVE in the ith iteration of 
the propagation step, and ^ its corresponding set generated by e-LVE. Let 
si = \Bi \ — 1 = \B[\ — 1. For i = 2, . . . , n, we define a variable Si recursively as 
Si = \Bi\-l + E^-eB^Vo s i = ~~ 1 + 52*'-eB{\v^ s r Intuitively, s t denote 
the number of sets from Vo that are required either directly or indirectly to 
compute ^ (and also ^j) minus one. 

The following result is needed for the correctness of the approximation. 

Lemma 30. For i £ 1, . . . , n, is a <p-covering for "J^, where f3 — a Si . 

Proof. We prove the result by induction on i. Since B\ = B[, it follows from 
Lemma I2U1 with a\ = ■■■ = = that |(B^ e g' ^J^^ 1 is a ^-covering 

for [®^ e g 1 ^ 1 , where (3 — a Sl . Hence, for any <fi & ^i there is <f>' G 
[0*eB' ^}~ Xl such that (a Sl , 0) ® 0' > </>. But for any 4>' there is 0" € *i such 
that 4>" > ft. Hence, the basis follows from (a Sl ,0) ® 4>" > (a Sl ,0) ® <j>' > (p. 
Assume the result holds for j = 1, . . . , i — 1, and let m = = |£>^|. The set ^ 
equals max([$i © • • • © where each either equals some 'J'j, with 

j < i, or is in Vo- Likewise, ^ = max([$'j ffi a • • • © a $| B ,|] _Xi )j where each 
<£>' fc either equals some ^ (so that = tyj implies $' fe = ^ and vice-versa) 
or is in Vq = Vo- For k = 1, . . . , m, let a k = sj if $ fe = (and $' fe = ^ ) 
and Ofc = if <£>fc G Vo- By inductive hypothesis, each <£>' fc is a < Q afc -covering 
of $fc (if G Vq then $' fc equals and it is therefore a <-covering for 
Then, by Lemma [2^1 $' x ©«•••©« is a <^-covering for $i ® ■ • • ® $ m , where 
/? = a "*-i+Er=i °* = Also, by (A[5J| [$i© a - ■ ■©a^| B .|]~ ;SCi is a ^-covering 
for [$!©•• - Cg> <E> | | ] — . Finally, the result follows from transitivity of the partial 
order, that is, ^ is a f^-covering for <lv □ 

The a-combinations make sure that the cardinality of the sets remains 
bounded during the propagation. Hence, if the diagram has bounded treewidth 
and number of states per variable, the following result guarantees the correct- 
ness of the approximation and the polynomial running time in the input length 
and 1/e. 

Theorem 31. If C is a LIMID with bounded treewidth and number of states 
per variable, then e-LVE is an FPTAS for MEU[£]. 

Proof. First, we show that e-LVE indeed obtains an e-approximation to MEU[£]. 

Let T\ =B\. For i = 2, . . . , n, let Ti = (Bi n Vo) U U* eBi\v ^ denote the 
collection of sets $ e Vo that were used directly or indirectly in the computation 
of Then s± = \Fi\ — 1. Assume by induction that Sj = \J-j\ — 1 for 
j = — 1. Hence, s,; = — 1 + E*eBi\Vo — wn i cn equals 

\Bi\ - 1 - \B\ Vo| +E*, eB4 \Vo l-^il- But \ B *\ ~ & \ V °\ ec l uals 1^ n v o|- Thus, 
Si = \Bi Pi Vo| — l + E* eB«\ v o l^j'l' which equals |J-"i| — 1 because the sets in Vo 
are used exactly once (hence the sets Tj for different fyj G Bi \ Vo are disjoint, 
and also Bi D Vo). 
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Let m = \U\. The collections V n and V' n contain only sets ^ and 
respectively, generated during the propagation. Thus, J2^ ev Si = m — l^«l> 
because there are |W| sets in Vo and each set belongs to exactly one Ti for *j G 
V n . Like in the exact case (i.e., in LVE), the valuations in ^)^, eV , are pairs 
(1,E S [£]) for some strategy s. Hence, max(0 $ . eVn vE^) and max(0^, eV , '&■) 
return each a single valuation. Consider the valuation <\>* G max((^) t . 6V ^j). 
By definition, <p* factorizes as (j>* , where each cf>* belongs to exactly one set 

in V n . According to Lemma [5(11 for each tp* there is 4>i G such that 
0* < (a Si , 0) <g> Thus, it follows from (AgJ that 

0* = (g 0* < (g [(a Si ,0)®^-] . 
*,ev„ 

By associativity of <g>, we have that 

(g [(a s S0)®^] = (a^> 

= (a m " 

Lct 4> = ®^ e v' Hence, (jf < a m-iv„i </>, which implies (j>* < a m </> and there- 
fore a m u > u*, where u and u* denote the utility part of 4> and 4>* , respectively. 
If </> is not in max(0 3 ,. eV , then there is (f>' G max(0 lf . eV , \E^) such that 
4> < 4>' , and thus </>* < a ™ 0'. Thus, we can assume without loss of generality 
that (j) G max(0,j , eV , \E^), so that e-LVE outputs the utility part u of <j). Since 
a = 1 + e/(2m), it follows from Lemma [571 (in the appendix) that 

(l + e)u > (1 + e/2m) m u > u* . 

Let us now analyze the time complexity of the algorithm. Let u> — 1 denote 
the tree- width of the network and k the maximum number of states of a variable, 
both considered bounded. Hence, for any variable in U we have that |f a^l < uj 
and |f2faxl — The initialization step takes then 0(m) time: 

ol 2 l°f*xl ] <0(\CU V\k u ) <0(\CU V|) <0(m) 

vcecuv J 

time to generate the sets associated to chance and value nodes, and 

^Jl^l 1 ^ 1 ) < 0(\V\k u k k ") < 0(\V\) <0{m) 





time to generate the sets associated to decision nodes. 

Let us consider the propagation step. First, we need to find all sets contain- 
ing a variable Xj. Since each set has a scope with at most lj variables, and there 
are 0(m) sets in Vi-i (the i — 1 sets generated in the previous iterations 

plus the 0(\ch Xt U {X t }\) < 0{m) sets from V \ \J]=\ the set &'i can be 
obtained in 0(muj) < O(m) time. 
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To compute we first have to compute the set of marginals (©# e g/ ^) Xi 
and then obtain the set of maximal valuations. To compute the former we need 
to perform |£>,-| — 1 operations G Q (<I>i €3 $2) and a marginalization, where $1 
and $2 are sets either in Vq or equal to some ^j, with j < i. Each set in 
B[ n Vq contains 0(k u ) < 0(1) elements. 

We will obtain a bound for the number of elements in some 

*;.=max([G Q ([$])]-^), 

where $ = \f r . Let t denote the smallest positive number in the prob- 

ability or in the utility part of a valuation in VE^ , and b denote the number of 
bits required to encode C Since the input probabilities and utilities are ratio- 
nal numbers, each positive input number is not smaller than 2~ b (otherwise we 
would need more than b bits to encode it). The valuations in ^ can be obtained 
by a sequence of marginalizations and combinations of valuations, where each 
valuation is in some $ g C Vo- Hence, t is obtained by a series of multiplica- 
tions and additions and it is therefore a polynomial on the input numbers (the 
probabilities and utilities associated to chance and value variables in C). For 
each variable X eCUV there are 0(k u ) input numbers. Therefore t is a poly- 
nomial of degree 0{mn u} ) < O(m). Since the inputs of the polynomial are either 
zero or some number greater than or equal to 2~ & , it follows that t > 2~ boi - m \ 
Let x denote the scope of 4^-. Since 4^ = max([G Q ,([$)] _Xj ), it follows from 
Lemma |2"?1 that * contains 0((1 - [\og a t\) 2 \ n ^) < 0([bm/ ln(a)] 2 ") elements. 
Since a = 1 + e/2m, we have from Lemma l3"51 fin the appendix) that 



O 



bm 



]n(a) 



< O 



bm 



1 + e/2m 
e/2m 



Hence, the number of valuations in any "J^- is 0([bm 2 /e] 2w ), which is polynomial 
in b, m and 1/e. 

The set ((J)^ g ^ i|f)~ x i can thus be obtained by 




< o 



combinations and 0([bm 2 /e] 2uJ ) marginalizations. The set of maximal valuations 
can be obtained by pairwise comparison of all valuations, and thus also takes 
polynomial time. Finally, computing a set V[ takes time proportional to the 



number of sets in V' i _ 1 and B^, which is a polynomial on m. 



□ 



5 Experiments 

We evaluate the performance of the algorithms on random LIMIDs generated 
in the following way. Each LIMID is parameterized by the number of decision 
nodes d, the number of chance nodes c, the maximum cardinality of the domain 
of a chance variable family up, and the maximum cardinality of the domain of 
a decision variable family lo£>. We set the number of value nodes v to be d + 2. 
For each variable Xi, i = 1, . . . , c+ d + v, we sample flxi to contain from 2 to 4 
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states. Then we repeatedly add an arc from a decision node with no children to 
a value node with no parents (so that each decision node has at least one value 
node as children). This step guarantees that all decisions are relevant for the 
computation of the MEU. Finally, we repeatedly add an arc that neither makes 
the domain of a variable greater than the given bounds nor makes the treewidth 
more than 10, until no arcs can be added without exceeding the bounds^ Note 
that this generates diagrams where decision and chance variables have at most 
log 2 u>d — 1 and log 2 wc ~~ 1 parents, respectively. Once the DAG is obtained, we 
randomly sample the probability mass functions and utility functions associated 
to chance and value variables, respectively. 

We compare LVE against the CR algorithm of de Campos and Ji Q in 
1620 LIMIDs randomly generated by the described procedure with parameters 
5 < d < 50, 8 < c < 50, 8 < cu D < 64 and 16 < cvn < 64. LVE was implemented 
in C++ and tested in the same computer as CR,ll2l Table [T] contrasts the running 
times of each algorithm (averages ± standard deviation) for different configu- 
rations of randomly generated LIMIDs. Each row contains the percentage of 
solved diagrams (Scr and Slve) and time performance (Xcr and Tlve) of each 
of the algorithms for N diagrams randomly generated using parameters d, c, v, 
uid, and uic- For each fixed parameter configuration, LVE outperforms CR by 
orders of magnitude. Also, CR was unable to solve most of the diagrams with 
more than 50 variables, whereas LVE could solve diagrams containing up to 150 
variables and with wd < 32. Both algorithms failed to solve diagrams ojd = 64. 
A diagram is consider unsolved by an algorithm if the algorithm was not able to 
reach the exact solution within the limit of 12 hours. All in all, LVE appears to 
scale well on the number of nodes (i.e., on d, c and v) but poorly on the domain 
cardinality of the family of decision variables (i.e., on lod)- 

A good succinct measure of the hardness of solving a LIMID is the total 
number of strategies |A|, which represents the size of the search space in a 
brute- force approach. |A| can also be loosely interpreted as the total number 
of alternatives (over all decision variables) in the problem instance. Figure [5] 
depicts running time against number of strategies in a log-log scale for the two 
algorithms on the same test set of random diagrams. For each algorithm, only 
solved instances are shown, which covers approximately 96% of the cases for 
LVE, and 68% for CR. We note that LVE solved all cases that CR solved (but 
not the opposite). Again, we see that LVE is orders of magnitude faster than 
CR. Within the limit of 12 hours, LVE was able to compute diagrams containing 
up to 10 64 strategies, whereas CR solved diagrams with at most 10 25 strategies. 

The reduction in complexity obtained by the removal of non-maximal valu- 
ations during the propagation step can be checked in Figure [71 which shows the 
maximum cardinality of a set "J/^ generated in the propagation step in contrast 
to the number of strategies. For each diagram (a point in the figure) solved by 
LVE the cardinality of the sets remains bounded above by 10 6 while we vary the 
number of strategies (which equals the largest cardinality of a propagated set in 
the worst case where no valuation is discarded). This shows that the worst-case 

9 Since current algorithms for checking whether the treewidth of a graph exceeds a fixed 
k are too slow for k > 5 pj, we resort to a greedy heuristic that resulted in diagrams whose 
actual treewidth ranged from 5 to 10. 

10 We used the CR implementation available at http: //www. idsia. ch/~ cassio/id2mip/ and 
CPLEX [l7ll as mixed integer programming solver. Our LVE implementation and the test cases 
are available at http://www.idsia.ch/-cassio/lve/ 
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Id 


AT 
1 V 


(X 


c 


V 




U!c 


JGR I/O J 


J-CK ($>) 


<JLVE I/O J 


TV, m fa\ 

^LVE (sj 


1 


60 


5 


8 


7 


12 


16 


100 


6 ± 45 


100 


0.006 ± 0.01 


2 


60 


5 


8 


7 


16 


16 


100 


9 ± 43 


100 


0.02 ± 0.05 


3 


60 


5 


8 


7 


8 


16 


100 


6 ± 51 


100 


0.002 ± 0.01 


4 


60 


10 


8 


12 


12 


16 


98 


15 ± 53 


100 


0.02 ± 0.02 


5 


60 


10 


8 


12 


16 


16 


93 


107 ± 273 


100 


103 ± 786 


6 


60 


10 


8 


12 


8 


16 


100 


0.4 ± 0.2 


100 


0.007 ± 0.01 


7 


60 


10 


28 


12 


12 


16 


96 


1175 ± 6126 


100 


0.05 ± 0.08 


8 


60 


10 


28 


12 


16 


16 


83 


3340 ± 8966 


100 


0.2 ± 0.2 


9 


30 


10 


28 


12 


16 


64 


10 


2838 ± 1493 


96 


47 ± 142 


10 


30 


10 


28 


12 


32 


16 


93 


1070 ± 2461 


100 


0.2 ± 0.4 


11 


60 


10 


28 


12 


32 


32 





— 


93 


905 ± 2847 


12 


30 


10 


28 


12 


32 


64 


3 


73 ± 


86 


2440 ± 7606 


13 


30 


10 


28 


12 


64 


64 





— 







U 


60 


10 


28 


12 


8 


16 


100 


1 ± 3 


100 


0.01 ± 0.007 


15 


60 


20 


8 


22 


12 


16 


93 


2687 ± 7564 


100 


155 ± 1196 


16 


90 


20 


8 


22 


16 


16 


38 


5443 ± 10070 


98 


270 ± 1822 


17 


30 


20 


8 


22 


16 


64 


30 


9660 ± 10303 


100 


29 ± 84 


18 


60 


20 


8 


22 


32 


32 







78 


938 ± 1417 


19 


30 


20 


8 


22 


32 


64 







76 


1592 ± 3402 


20 


30 


20 


8 


22 


64 


64 












21 


60 


20 


8 


22 


8 


16 


100 


7 ± 20 


100 


0.02 ± 0.008 


22 


60 


10 


78 


12 


16 


16 


60 


5944 ± 9920 


100 


0.5 ± 0.5 


23 


30 


10 


78 


12 


32 


16 


70 


3820 ± 8127 


100 


0.6 ± 1 


24 


60 


20 


58 


22 


12 


16 


50 


6455 ± 9344 


100 


522 ± 4011 


25 


60 


20 


58 


22 


16 


16 


11 


11895 ± 12662 


100 


2 ± 11 


26 


60 


20 


58 


22 


8 


16 


96 


849 ± 4098 


100 


0.07 ± 0.04 


27 


60 


30 


38 


32 


12 


16 


28 


3416 ± 4827 


98 


35 ± 214 


28 


30 


30 


38 


32 


16 


16 







100 


2 ± 10 


29 


60 


30 


38 


32 


8 


16 


96 


2261 ± 6572 


100 


0.1 ± 0.03 


30 


30 


30 


88 


32 


12 


16 







100 


230 ± 1027 


31 


30 


30 


88 


32 


8 


16 


60 


3448 ± 5837 


100 


0.2 ± 0.1 


32 


30 


50 


48 


52 


12 


16 







96 


1753 ± 7405 


33 


30 


50 


48 


52 


8 


16 


10 


5014 ± 2974 


100 


0.5 ± 0.09 



Table 1: Performance of LVE and CR on randomly generated LIMIDs (numbers 
arc rounded down). 
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Number of strategies (|A|) 



Figure 6: Running time of LVE and CR on randomly generated LIMIDs. 



analysis in Section [3. II is excessively pessimistic. 

We also compared the performance of LVE with its FPTAS version e-LVE. 
The results using approximation factors e = 0.1 and e = 0.01 are in Table [21 
The numbers in the first column identify each row by the corresponding row 
in Table Q] The second and the fifth columns describe the percentage 5(e) of 
instances solved by e-LVE within 12 hours for different approximation factors e; 
third and sixth columns report the average and standard deviation of relative 
running time on instances which both e-LVE and LVE were able to solve within 
the time limit, that is, 



AT(e) 




where T(e) denotes the running time of e-LVE (run with approximation factor 
e) on an instance, n(e) denotes the number of cases solved by both e-LVE and 
LVE, and the sums are over these cases. Negative values of AT denote (sets of) 
instances on which e-LVE was faster than LVE. Finally, the fourth and the last 
columns show the relative maximum cardinality of a set in e-LVE with respect 
to LVE: 



AC(e) - W) ^ c LVE 



± 



/ 1 V / C(6)-CL VE y / 1 v C(e)-CW 2 



where C(e) denote the maximum cardinality of a set ^\ produced by e-LVE 
with approximation factor e, and Clve = maxj |^|. As with AT, negative 
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Number of strategies (|A|) 

Figure 7: Maximum number of valuations in a set during the propagation step 
of LVE. 



values of AC indicate cases in which e-LVE produced (on average) sets smaller 
in cardinality than those produced by LVE. From the table, we see that the ap- 
proximation algorithm is slower (on average) than the exact version apart from 
three sets of instances (viz. sets 6, 30, 31, 32) with approximation factor 0.1. We 
credit the inferior performance of e-LVE to the extra complexity introduced by 
the G a operations. Additionally, as Theorem [5] shows, the polynomial running 
time of e-LVE can be obtained only by considering the cardinality of domains 
bounded, which in our experiments signifies a low value of max{cL>£), ^c}- To 
see that the G Q operation indeed remove elements from the sets, note that the 
maximum cardinality of a set ^ produced by e-LVE is smaller (on average) than 
the maximum cardinality of a set produced by LVE on almost all instances 
(i.e., AC is negative). Finally, we note that with e = 0.1, e-LVE was able to 
solve six cases which LVE failed to solved within the time limit, whereas LVE 
solved two cases which e-LVE could not solve. For e = 0.01, the exact version 
was able to solve 14 cases where the approximation failed, and e-LVE solved five 
cases which LVE was not able to solve. All cases solved by e-LVE with e = 0.01 
were also solved with e = 0.1 (but not the converse). 



6 Related Work 

Influence diagrams were introduced by Howard and Matheson [l2| as a con- 
cise language for the specification of utility-based decision problems. There is a 
substantial literature that formalizes influence diagrams and develop algorithms 
under the premises of no forgetting and regularity 0, 2(J 21 1. We point the in- 



terested reader to the works of Jensen and Nielsen [13J and Koller and Friedman 

Zhang et al. [23j studied families of LIMIDs that could be solved by dynamic 
programming, such as LIMIDS respecting no forgetting and regularity. The SPU 
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id 5(e = 0.1) AT(e = 0.1) AC(e = 0.1) 5(6 = 0.01) AT(e = 0.01) AC(e = 0.01) 



1 


100 


0.13 ± 0.43 


-0.15 


± 


0.28 


100 


1.48 ± 2.96 


-0.15 


± 


0.28 


2 


100 


0.27 ± 0.57 


-0.11 


± 


0.24 


100 


3.16 ± 4.70 


-0.11 


± 


0.24 


3 


100 


0.01 ± 0.11 


-0.12 


± 


0.24 


100 


0.39 ± 1.41 


-0.12 


± 


0.24 




100 


0.24 ± 0.49 


-0.18 


± 


0.31 


100 


2 58 ± 3 16 


-0.18 


± 


0.31 


5 


100 


0.59 ± 0.83 


-0.13 


± 


0.31 


100 


6.60 ± 6.47 


-0.13 


± 


0.31 


6 


100 


-0.02 ± 0.10 


-0.18 


± 


0.28 


100 


0.18 ± 0.49 


-0.18 


± 


0.28 


7 


100 


41 ± 56 


-0.12 


± 


0.25 


100 


4.25 ± 4.55 


-0.12 


± 


0.25 


«§ 


100 


1.22 ± 0.90 


0.00 


± 


0.00 


100 


11.92 ± 9.01 


0.00 


± 


0.00 




96 


1.94 ± 1.04 


0.00 


± 


0.00 


96 


19.94 ± 10.64 


0.00 


± 


0.00 


10 


100 


1.08 ± 1.10 


0.00 


± 


0.00 


100 


10.64 ± 10.57 


0.00 


± 


0.00 


11 


93 


1.55 ± 1.13 


-0.02 


± 


0.12 


90 


15.71 ± 9.78 


-0.02 


± 


0.12 


12 


80 


1.54 ± 0.91 


-0.01 


± 


0.05 


73 


15.90 ± 9.16 


-0.01 


± 


0.05 


13 


o 










o 










1A 


100 


07 ± 34 


-0.14 


± 


0.24 


100 


0.81 ± 1.62 


-0.14 


± 


0.24 


15 


100 


0.18 ± 0.35 


-0.23 


± 


0.35 


100 


2.58 ± 2.30 


-0.23 


± 


0.35 


16 


98 


0.91 ± 1.19 


-0.24 


± 


0.40 


98 


10.37 ± 9.86 


-0.24 


± 


0.40 


17 


100 


1.07 ± 1.05 


-0.14 


± 


0.30 


100 


11.64 ± 9.69 


-0.14 


± 


0.30 


18 


78 


1.69 ± 1.36 


-0.02 


± 


0.12 


66 


15.99 ± 13.47 


-0.03 


± 


0.13 


19 


76 


1.45 ± 1.65 


-0.16 


± 


0.31 


73 


15.07 ± 14.62 


-0.17 


± 


0.31 


20 
























21 


100 


0.14 ± 0.33 


-0.20 


± 


0.26 


100 


0.54 ± 1.02 


-0.20 


± 


0.26 


22 


100 


1.32 ± 1.25 


0.00 


± 


0.00 


100 


13.02 ± 12.37 


0.00 


± 


0.00 


23 


100 


0.99 ± 1.12 


0.00 


± 


0.00 


100 


9.83 ± 10.40 


0.00 


± 


0.00 


24 


100 


0.25 ± 0.54 


-0.25 


± 


0.36 


100 


3.72 ± 3.88 


-0.25 


± 


0.36 


25 


100 


1.74 ± 1.31 


0.00 


± 


0.00 


100 


17.38 ± 12.94 


0.00 


± 


0.00 


26 


100 


0.02 ± 0.17 


-0.20 


± 


0.31 


100 


0.44 ± 0.62 


-0.20 


± 


0.31 


27 


98 


0.10 ± 0.37 


-0.29 


± 


0.40 


98 


2.28 ± 1.90 


-0.29 


± 


0.40 


28 


100 


1.67 ± 1.18 


0.00 


± 


0.00 


100 


16.93 ± 12.03 


0.00 


± 


0.00 


29 


100 


0.03 ± 0.08 


-0.13 


± 


0.26 


100 


0.31 ± 0.30 


-0.13 


± 


0.26 


30 


100 


-0.06 ± 0.51 


-0.62 


± 


0.41 


100 


2.63 ± 2.80 


-0.62 


± 


0.41 


31 


100 


-0.03 ± 0.10 


-0.44 


± 


0.40 


100 


0.24 ± 0.30 


-0.44 


± 


0.40 


32 


96 


-0.22 ± 0.41 


-0.82 


± 


0.25 


96 


0.91 ± 1.59 


-0.82 


± 


0.25 


33 


100 


0.01 ± 0.02 


-0.38 


± 


0.33 


100 


0.12 ± 0.10 


-0.38 


± 


0.33 



Table 2: Relative (to LVE) performance of e-LVE with e = 0.1 and e = 0.01 
(numbers are truncated). 
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algorithm of Lauritzen and Nilsson [16| solves these cases in polynomial time 
if the diagram has bounded treewidth. To the best of our knowledge, the only 
attempt to (globally) solve arbitrary LIMIDs exactly without recurring to an 
exhaustive search on the space of strategies is the CR algorithm of de Campos 
and Ji [H[ against which we compare our algorithm. 

Shenoy and Shafer [22| introduced the framework of valuation algebras, 
which states the basic algebraic requirements for efficient computation with val- 
uations. More recently, Haenni [11| incorporated partially ordered preferences 
in the algebra to enable approximate computation. Fargier et al. [9j then ex- 
tended the framework with a preference degree structure in order to capture the 
common algebraic structure of optimization problems based on a partial order. 
The algebra we develop in Section [3] can be partly casted in this framework. 

The variable elimination algorithm we develop here is conceptually close 
to the message passing algorithm of Dubus et al. Q- Their algorithm, how- 
ever, does not handle uncertainty and target primarily the obtention of Pareto- 
efficient solutions for a specific class of multi-objective optimization problems. 

There is a close relation between maximum a posteriori (MAP) inference in 
Bayesian networks and LIMIDs whose decision variables have no parents. In 
this sense, the algorithm of de Campos jij, which solves MAP by propagating 
Pareto efficient probability potentials in a join tree, relates to ours. 

7 Conclusion 

Solving limited memory influence diagrams is a very hard task. The complexity 
results presented here show that the problem is NP-complete even for diagrams 
with bounded treewidth and number of states per variable, and that obtaining 
provably good approximations in polynomial time is unlikely if the number of 
states is not small. Remarkably, as we show here, if the cardinalities of the 
variable domains are bounded by a constant, the problem does have a fully 
polynomial time approximation scheme. 

Despite the theoretical hardness of the problem, we developed an algorithm 
that performed empirically well on a large set of randomly generated problems. 
The algorithm efficiency is based on the early removal of suboptimal solutions, 
which helps the algorithm to drastically reduce the search space. In the worst 
case, the algorithm runs in time exponential in both the width of the elimination 
ordering and the cardinality of decision variables. 

In the experiments we conducted, the approximation did not result in a 
speed up of running time compared to the exact algorithm. This might be 
caused by the large constants produced by the boundedness assumptions, but 
might also be due to the ability of the exact algorithm in discarding many in- 
termediate solutions. We note, however, that the simple existence of an efficient 
approximation shows that faster algorithms might exist, for instance, by allow- 
ing additive instead of multiplicative errors, or by coupling these ideas into a 
more sophisticated framework of propagation of functions. 

Designing good heuristics for elimination orderings seems to be more com- 
plex than with standard variable elimination algorithms (e.g., for belief updating 
in Bayesian networks), because there is a second component, the cardinality of 
a set, that together with domain cardinalities we wish to minimize. In fact, 
some preliminary experimentation has shown that favoring set cardinality at 
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expense of domain cardinality might be a good approach. Unlike standard vari- 
able elimination, given an elimination ordering and a LIMID, it does not seem 
to be possible to determine the true complexity of LVE in advance (i.e., prior 
to running the algorithm). 



Appendix A. Missing Proofs and Additional Re- 
sults 

This section contains long proofs that were left out of the main part to improve 
readability, and less central results used in some of the proofs. Some of the 
results in here are based on results obtained elsewhere and reproduced here 
(albeit with minor modifications) for completeness and ease of reading, but 
most are contributions of this paper. Results that are largely based on previous 
results contain a mention to the source; otherwise it is a new result. 

The following two lemmas are used in the proof of Theorem [4] later on. 

Lemma 32. If a > —2 is a real number and i is a nonnegative integer then 
2 « _j_ 2~(*+3) < 2 Q+2 ~\ 

Proof. A similar result was shown by de Campos 0, Lemma 15]. Since 2" > 2~ 2 , 
we have that 2 Q + 2"( l+3 ) = 2" + 2~ 2 • 2^* < 2 Q (1 + 2-*- 1 ), and it is sufficient 
to show that 1 + 2~ 4 ~ 1 < 2 2 . From the Binomial Theorem we have that 



(1 + 2-r =E( 2 J)(2- 



i-l\k 



k=0 

For k = 0, . . . , 2 l , we have that 

'2*\ 2*(2 l -l)---(2 ,; 



Hence, 



k k\ 



2* 2* 



<(T) 



i\k 



(1 + 2-' 1 - 1 ) 2 ' < ^{2 t ) k {2- l - 1 ) k = 2 ~ k < T 



,-fc 

k=o k=a k=a 



and therefore 1 + 2" 1 " 1 < 2 2 '. □ 
Lemma 33. If < x < 1/2 then 2 X - 1 + 2-' x - 1 > 2 X " . 

Proof. We obtain the result by approximating the functions on the left- and 
right-hand side of the inequalities by their truncated Taylor expansions f(x) and 
g(x), respectively, and then showing that 2 X ~ 1 + 2~ x ~ 1 > f(x) > g(x) > 2 X . 
The n-th order Taylor expansion of the left-hand side around zero is given by 
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Clearly, the series converges and hence 2 X ~ 1 + 2~ x ~ 1 = lim„_ i . 00 T n (x). More- 
over, for any n, the residual R n {x) — 2 X ~ 1 + 2~ x ~ 1 — T n (x) is positive because 
the terms of the sum are all non negative. Thus, 

f(x) = T 2 (x) = 1 + < 2*-i + 2- x - 1 . 

In a similar fashion, we apply the variable change y — x A on the right-hand 
side and obtain its Taylor expansion around zero, given by 

fc=l 



K2)]* « 



M 
fe=i 



-:r 



which also converges and has positive residual. Hence, 
2 xi = lim T^(x) 



ifc— l 



1 + x 4 ln(2) + x 2 ln(2) ( ]T , 1 ; 2 



\k=2 



< l + x 4 ln(2)+x' 



ln ( 2 ) (£ sir) 



= l + x 4 ln(2) + M^ 2 = ff (*). 

The inequality is obtained by noticing that [ln(2)] fc -7 fc! < 1/2, x < 1/2 < ln(2) 
and that the geometric series 

2^ 2 4fe -! ~ 2 7 ^ V 2 4 J < 2 7 ^-^ I 2 J ~ 2 6 < 32 ' 

fc=2 fc=0 V 7 fc=0 V 7 

Finally, since x 2 < 1/4 < 151n(2)/32 we have that 

9 (*) = l + x 2 l„(2) (* 2 + ^) 

< 1 + ^ 1 , 1 ,2,(| h(2) + !f)) 
_ 1 + 25021!,. _ /W . 

Hence, 2 x4 < g(x) < f(x) < 2*- 1 + 2-*- 1 and the result holds. □ 

Proof of Theorem^ Given a strategy s, deciding whether -E s [£] > k can be 
done in polynomial time according to Proposition [3] 

Hardness is shown using a reduction from the partition problem, which is 
NP-complete (Iol | and can be stated as follows. Given a set of n positive integers 
ai, . . . , a n , is there a set X C A — {1, . . . , n} such that X^igi a i ~ 12ie.A\x ai ^ 
We assume that n > 3. 
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Let a = \^2i i( z j \0ii. An even partition is a subset I C A that achieves 
Siei ai = a " s °l ve partition, we consider the rescaled problem (dividing 
every element by a), so that u$ — - aija < 2 are the elements and we look for a 
partition such that Yliei Vi = 1 (because J2ieA Vi = 

Consider the following LIMID with topology as in Figure [TJ There are n 
binary decision nodes labeled £>!,..., D n . Each decision can take on states 
di and c?2- The chain of chance nodes has n+1 ternary variables Xo, Xx, . . . , X n 
with states x, y, and 2:. There is an arc from X n to the single value node R. 
For notational purposes, we specify a function / over the domain {x,y,z} as 
a triple (f(x),f(y),f(z)). The value node has an associated utility function 
U R = (0, 0, 1). For i — 1, ... ,n, each chance node Xi has an associated set of 
conditional probability mass functions given by 

p% m = (t i ,0,l-ti), p%~ = (1.0,0), 

= (0,1,0), pJ' w = (Q,ti,i-*i), 

p%* = (0,0,1), P *;* = (0,0,1), 

for <j G [0, 1] (we specify these variables later on). Note that Px %X%1 ( w ) = for 
every w G £^fa x . such that to-^** ^ w^^ 1 and u;-l- Xi ^ 2;. Finally, we define 
PXo =(l/3,l/3,'l/3). 

Given a strategy s = (<Jd 15 . . . , #£>„), let I = {i : = di} be the index set 
of policies in s such that (A) = di. We have that 

CUT> \ »=1 / 

x„ \cui?\{js:„} i=i / 



Let 



and 



A TT D.X,-! 



CUP\{X„} i=l CU25\{X„} 

For tu G ^cup such that = a; (i.e., for u; G x^ CuV ) it follows that 

p x " (tt)^ fax ™ ) 7^ if and only if ■uj-I--'''™- 1 = x. But for io^™- 1 = x we have 
that j,^™- 1 ^™- 2 (iA>"'' fax "~ 1 ) ^ if and only if w^ x ™- 2 — x and so recursively. 

Also, for any i G {1, . . . , n}, Px' 11 (w^ iax i ) equals ti if i G X and 1 otherwise. 
Hence, 

_ J I ILez *i> if w ^ Xi = x for i = 1,. . . ,n- 1 



p s {w) = 

and 



0, otherwise, 



1 - 

Px n {x)= P»W=oIl t! 



3 

mex tcuT> i G x 
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Likewise, it holds for w £ y1~ CuX> that 

Ps(w) = 
and therefore 




if = y for i = 1, . . . , n — 1 

otherwise, 



1 



3 

Since y>x„ is a probability mass function on X n , px n (z) = 1 — px n (x) 
Px n (y), and 



= 1 - Px„ (a;) - Px n (y) 

= i -lu t ^-l n *i- 

Let us assume initially that ii = 2 _ " Ui . The reduction from the original 
problem in this way is not polynomial, and we will use it only as an upper 
bound for the outcome of the reduction we obtain later. It is not difficult to 
see that E s [£] is a concave function of V\ , . . . , v n that achieves its maximum at 
J2iex Vi = ^2ieA\x Vi = 1- Since each strategy s defines a partition of A and 
vice-versa, there is an even partition if and only if MEU[£] = 1 — 1/3(1/2+1/2) = 
2/3. 

We will now show a reduction that encodes the numbers ti in time and space 
polynomial in 6, the number of bits used to encode the original problem. In fact, 
this is in close analogy with the final part of the proof of Theorem 10 in 

By setting t t to represent 2~ Vi with 66 + 3 bits of precision (rounding up 
if necessary), that is, by choosing ti so that 2~ Vi < ij < 2~ Vi + q, where 
< Si < 2-( 6b+3 \ we have that 2~ Vi <U < 2- v > +2-( 6b+3 \ which by LemmaEU 
(with a = -Vi > -2 and i = 66) implies 2~ v< < U < 2- v ' +2 ~ 6 " . 

Assume that an even partition X exists. TherT^l 

]JU < 2 2 ~ 6bn -^iv> = 2 -i+2- 66 n < 2-1+2- 56 f 
iei 

U U< 2 2 ~ 6bn -^*\^ = 2- 1+2 ~ Bbn < 2 - 1+2 ~ 5b , 
ieA\z 



and 



MEU[£] > 1 - ~ (2- 1+2_56 + 2- 1+2_5b ) = 1 - (6) 



Let r be equal to 2 2 encoded with 56 + 3 bits of precision (and rounded up) , 
that is, 2 2 ~"° b < r < 2 2 ~ 5 " + 2" (5b + 3 ), which by Lemma [32] (with a = 2~ 5h > -2 
and i = 56) implies 

n — 5b q — 5 b I o — 5b nl- 5b i-i — 4b 

2 2 < r < 2 2 + 2 = 2 2 < 2 2 . (7) 



11 Since the number of bits used to encode the partition problem must be greater than or 
equal to n, we have that n/2 6 < n/b < 1, and hence 2~^+ 1 'i b n < 2~3 b , for any j > 0. 
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The reduction is done by verifying whether MEU[£] > 1 — r/3. We already know 
that an even partition has an associated strategy which obtains an expected 
utility greater than 1 — r/3, because of Equality ^ and the fact that r is 
rounded up. Let us consider the case where an even partition does not exist. 
We want to show that in this case MEU[£] < 1 — 2 2 /3, which by Inequality ([7]) 
implies MEU[£] < 1 — r/3. Since there is not an even partition, any strategy 
induces a partition such that, for some integer —a < c < a different from zero, 
we have that 5Z ig j o>i = a — c and Sie^4\i a-i = a + c, because the original 
numbers ai are positive integers that add up to 2a. It follows that 

n* i+ n u =2 c i°-- 1 +2- c i°-- 1 . 

iez ieA\x 

The right-hand side of the equality is a function on c £ {—a, . . . , a} \ {0}, which 
is symmetric with respect to the y-axis (i.e., /(c) = /(— c)) and monotonically 
increasing for c > 0. Therefore, it obtains its minimum at c = 1. Hence, 

Y[U+ Yl U>2 l ' a - 1 + 2- l ' a - 1 . 

Since n > 3 implies a > 2 (because the numbers ai are positive integers), we 
have by Lemma [3"31 that 

c^X/a— 1 _j_ 2~ X/a— 1 c^l/a 4 * 

Each number et^ is encoded with at least log 2 bits, and therefore b > 
log 2 (ai) + • • • + log 2 (a„) = log 2 (ai • • • a n ). The latter is greater than or equal 
to log 2 (&i + • • • + a n ), and hence is also greater than log 2 a. Thus, we have that 
a < 2 b , which implies a 4 < 2 4b and therefore 1/a 4 > 2~ 4b and 2 1 ' ai > 2 2 ~ 4 " . 
Hence, 

2 i/o-i + 2 -i/o-i > 2 2 ~ 



Thus, if an even partition does not exist we have that 

1 / \ 2 2 ~ 4 " 

MEU[£] = 1 - 3 I II**+ II «i < 1 - — < 1 - r/3. 

To summarize, we have built a LIMID C in polynomial time since each ti 
was specified using 0(b) bits and there are n functions Px %X%1 1 each encoding 
18 numbers (which are either 1, or tj), and 2n + 2 variables with bounded 
number of states. We have shown that there is a one-to-one correspondence 
between partitions of A in the original problem and strategies of C, and that 
for a given rational r = f(b) encoded with 0(b) bits the existence of an even 
partition is equivalent to MEU[£] > 1 — r/3. □ 

The following lemma is used in the proof of Theorem [6j 

Lemma 34. For any x > 1 it follows that x + 1/2 > l/ln(l + 1/a;). 

Proo/. Adapted from Lemma 9 of 0. Let /(x) = ln(l + 1/a;) - l/(ac + 1/2). 
Then 

f'( x ) = I 

3 y ' x 2 + x x 2 +x + l/A' 
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t t t 




Figure 8: Graph structure of the LIMID used in the proof of Theorem \E\ 



which is strictly negative for x > 1 since x 2 +x < x 2 + x + 1/4. Hence, f(x) is a 
monotonically decreasing function for x > 1. Because lim. E _>. 00 /(a;) = 0, f(x) is 
strictly positive in [l,oo). Thus, the result follows from ln(l+l/x) > l/(x+l/2), 
since x > 1. □ 

Proof of Theorem We will show that for any fixed < 7 < 1 the existence 
of a polynomial time (2^ 7 — l)-approximation algorithm for solving a LIMID 
would imply the existence of a polynomial time algorithm for the CNF-SAT 
problem, which is known to be impossible unless P=NP [Toj . A very similar 
reduction was used by Park and Darwiche [3, Theorem 8] to show an analo- 
gous inapproximability result for maximum a posteriori inference in Bayesian 
networks. Notice that for any < e < 2 e — 1 there is 7 < 1 such that e = 2® 1 — 1, 
hence the existence of an e- approximation algorithm implies the existence of a 
(2 97 — l)-approximation, and it suffices for the desired result to show that the 
latter cannot be true (unless P=NP). 

A clause is a disjunction of literals, each literal being either a boolean variable 
or its negation. We say that a clause is satisfied if, given an assignment of truth 
values to its variables, at least one of the literals evaluates to 1. Thus, we 
can decide if a truth-value assignment satisfies a clause in time linear in the 
number of variables. The CNF-SAT problem is defined as follows. Given a set 
of clauses C\, . . . , C m over (subsets of) boolean variables X\, . . . , X n , is there 
an assignment of truth values to the variables that satisfies all the clauses? 

For a positive integer q that we specify later on, consider the LIMID ob- 
tained as follows (the topology is depicted in Figure [8]). For each boolean 
variable Xj we add q binary decision variables D\ , . . . , D\ and q chance vari- 
ables Si,..., Si with domain {0,1,..., m}. Additionally, there are q clause 
selector variables Sq, . . . ,Sq taking values on {1,2, ...,m}, q binary variables 
B 1 , . . . , B q , and a value node U with B q as parent. As illustrated in Figure |5J 
the LIMID consists of q replicas of a polytree-shaped diagram over variables 
D{, . . . , D 3 n , Sq, . . . , B? , and the probability mass functions for the variables 
B 1 , . . . , B q are chosen so as to make the expected utility equal the product of the 
expected utilities of each replica. In any of the replicas (i.e., for j 6 {1, . . . , q}), 
a variable D\ (i = 1, . . . , n) represents an assignment of truth value for JQ and 
has no parents. The selector variables Sq represent the choice of a clause to pro- 
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cess, that is, Sq = k denotes clause Ck is being "processed", and by summing 
out Sq we process all clauses. Each variable Sf , for i = 1, . . . , n and j = l,...,q, 
has D\ and S\_ x as parents. The variables B^> have S 3 n and, if j > 1, B^ 1 as 
parents. For all j, we assign uniform probabilities to Sq, that is, p s i = 1/m. 

For j = 1, . . . , q, we set the probabilities associated to variables S{ , . . . , so 
that if Ck is the clause selected by Sq then Sf is set to zero if C\ is satisfied 
by Di but not by any of D\, . . . , and S*f = S° i _ 1 otherwise. Formally, for 



x G ft 



we have that 



1, if ar^ = x 4 - 5 *-! = ; 

1, if x iS l = and x^*- 1 = k > 1 and = a;^ 1 satisfies C fe ; 

1, if x^ s * = x^*- 1 = k > 1 and = does not satisfy C fc 

0, otherwise. 



Notice that for Sf the first case never occurs since Sq takes values on {1, ... , m}. 
For any joint state configuration x of Sq, . . . , S^, D{, . . . , D 3 n such that x^ s ° = 
k G {!,..., m} (i.e., clause Ck is being processed) and x^ s ™ — 0, it follows that 



equals 1/m only if for some < i < n clause Ck is satisfied by Xi = x^ Di but 
not by any of X\ = x^ Dl , . . . , = x^ Di - x , variables S{ , . . . , S 1 i _ 1 all assume 

value k (i.e., x^ s ' = ■ ■ ■ = x^^ 1 = k), and x 



is] 



X 



isi 



0. Otherwise, 



it equals 0. Hence, for any (partial) strategy s 3 = (S D i, . . . ,6 D j ) we have for 
x = that 



/ 



\ 



E II/'- "PL* 



(x) 



SAT (»> 
m 



where SAT^) denotes the number of clauses satisfied by the truth- value as- 
signment of X\ , . . . , X n according to s J . Each variable B^ is associated to a 

function p B ™ such that for x G O f a . , 



P B 1 ( x > 



'l, ifa;^ '=x^ _1 andx^i =0: 
1, if a;^ 3 = and a^ s ~ ^ ; 
0, otherwise; 



where for i? 1 we assume x^ B = 1. Hence, we have for any joint state configu- 
ration a; of B\ ... , B«, S£, . . . , S« that 



vJ' =1 



(tf) 



fl, if^ Bl =- 

1, ifa^ Bl =- 
0, otherwise. 



ar 173 ' = 1 and x iS <* = ■ ■ ■ = x iS ™ = 0; 
x lB " = and x ls ™ ^ 0; 
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Finally, we set the utility function u associated to U to return 1 if B q = 1 
and otherwise. In this way I u Il|=i Psi ) ( x ) ec l uais 1 if X ^ B = ■ ■ ■ = 
x lB q _ i an j x iS n = ... = X IS% _ q an j zcr0 otherwise. Thus, for any 
strategy s = (s 1 , . . . , s 9 ), where s- 7 = , . . . , , it follows that 



q n . . 

S 3 B 3 - 1 TT D l s l- 



cuv j=i i=i 

= E U L[4^ 1 E ''>. ll/'s/ I'n 

B 1 ,...,B q i=i S^-.-.S^! i=1 

S n>---> S n D{,...,D 3 n 

q , 

Err s^s^ 1 s 3 

B 1 ....,B q i=l 

If the instance of CNF-SAT problem is satishable then there is an optimum 
strategy s such that SAT{s^) = to for all j, and MEU[£] = 1. On the other 
hand, if the instance is not satishable, we have for all j and strategy s that 
SAT(s j ) < m-1, and hence MEU[£] < {m-l) q /m q . For some given < 7 < 1, 
let q be a positive integer chosen so that l/2 e "' > m q /(m + l) q . We show later 
on that q can be obtained from a polynomial on the input. If the CNF-SAT 
instance is satishable, a (2 0T — l)-approximation algorithm for MEU[£] returns 
a value E s [C] such that 

MEU[£] / m \ 9 (m-1 



r „ n MEULC] f m Y 



m 



where the rightmost strict inequality follows from m/(m + 1) > (m — l)/m. On 
the other hand, if the CNF-SAT instance is not satishable, the approximation 
returns 

E s [£] < MEU[£] < 1 U> ~ ' 

Hence, we can use a (2 s '' - 1) -approximation algorithm to solve CNF-SAT by 
checking whether its output E[£] > (to — l) q /m q . Since q and to are positive 
integers, the test bound (m — l) q /m q can be obtained in polynomnial time. 

It remains to show that the reduction is polynomial in the input. The LIMID 
contains q{2n + 2) + 1 variables, each requiring the specihcation of at most 
2(to + l) 2 numbers in {0, 1/m, 1}. So 0, the number of numerical parameters in 
C, is polynomially bounded by q(m + l) 2 (4n + 4) + 2. Therefore, it suffices to 
show that q is a polynomial on to and n. By dchnition, q obeys 



(l + J_V > 2 [9(™+l) 2 (4«+4)+2P 
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which is equivalent to 



gin (1 H ) > q 1 Urn + if (An + 4) + 2] 7 In 2 

\ TO/ 

x _ 7 [(TO + l) 2 (4n + 4) + 2r 

" 9 > — mi+s — ln2 

^ g >f^ + 1 ^ + 1 t ) + 2]7 ln2^ 

Since by Lemma l34l to + 1/2 > l/ln(l + 1/to) and 2 > ln(2), it suffices to 
choose q such that 

q > ((2to+ 1)[(to+ l) 2 (4n + 4) + 2p , ) T ^ f . 

In other words, q is polynomially bounded by to 1 — ? 4u 1 -t. Therefore, if MEU[£] 
can be approximated in polynomial time with an error no greater than 2 7 then 
we can solve CNF-SAT in polynomial time. □ 

We now formally prove the correctness of the LVE algorithm. We start by 
showing that max distributes over set marginalization and set combination: 

Lemma 35. (Distributivity of maximality) . If^ x C Q x and C $j, are two 
finite sets of ordered valuations and z C x, the following holds. 

(i) max(* x ® max(* 9 )) = max^ ® ^ y ); 

(ii) max(max(^I' 2 ;)~'' z ) = max(5'^ z ). 

Proof. Part (i) has been shown by Fargier et al. jjj Lemma l(iv)]. We use a 
similar proof to show that part (ii) also holds. First, we show that max 
max(ma.x(^> x )^ z ). Assume, to show a contradiction, that there is an element 
(f>^. z G max(^;jf ), where <f> x G ty x , which is not an element of max(max(^ x )^ z ) . 
By definition of max(5 , x ), there is ip x G max(^ x ) such that <p x < ip x . Hence, 
(AEJ) implies ^ < ipj. z , and because ip^ z G it follows that 4> x z — ^if> 
and therefore (j)\. z G max(5' x f z . Since <p^. z ^ max(max(5' x )-'- z ) there is </> z G 
max(max(4' a ;)-'' z ) such that <pjf < (j) z . But this contradicts our initial assump- 
tion since (f> z G ^ x z - 

Let us now show that max(5'^ z ) D m&x^&x^ x f z ) . Assume by contradic- 
tion that there is ip z G max(max(5 , 2; )~'' z ) \ max(5'^ z ). Since ip z G ^ x z , there 
is (f> z G max^^ 2 ) such that ip z < <\> z . But we have shown that max^^ 2 ) C 
max(max(4' x )-'- z ), hence ip z = tfi z and ip z G max^jjf), a contradiction. □ 

At any iteration i of the propagation step, the combination of all sets in 
the current pool of sets Vi produces the set of maximal valuations of the initial 
factorization marginalized to Xi+i, . . . , X n : 

Lemma 36. For i G {0, 1, . . . , n}, it follows that 

- -{Xi,...,X 4 }^ 



max 



*eVo 




where for each i, Vi is the collection of sets of valuations generated at the i-th 
iteration of the propagation step of LVE. 
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Proof. By induction on i. The basis (i — 0) follows trivially. 
Assume the result holds at i, that is, 



(g * 

,*ev . 



-{X lt ...,Xt}\ 




By eliminating Xi+x from both sides and then applying the max operation we 
get to 



max 



max 



(g * 

,*ev . 



-{X u ...,X t }\ 



-x i+1 " 




Applying Lemma 1551 ii) to both sides and (A|2J) to the left-hand side yields 



(g * 

,*ev . 



-{X u ...,X i+1 }^ 



max 



max 



max 



-x 44 



(g * 

*eVi\B i+ i 

*eVi\B 4+1 
(g * 

(g) *) , 

*GV i+ i / 



(g * 

*eB i+ i 



(g) * 



-x <+ A 



where the passage from the first to the second identity follows from element- 
wise application of (A[JJ and Lemma [Til the third follows from the second 
by Lemma l33l i) . and the last two follow from the definitions of "J^ and V-i+i, 
respectively. □ 

We are now able to show the correctness of the algorithm in solving LIMIDs 
exactly. 

Proof of Theorem llffl The algorithm returns the utility part of a valuation (p, u) 
in max (&)y eV VP) , which, by Lemma l36l for i = n, equals max ( [(^)^ eVo ^] . 
By definition of Vo, any valuation <fr in (0^, eVo factorizes as in (JSJ). Also, 
there is exactly one valuation <f> £ (®^ 6 y f° r eac ^ strategy in A. Hence, 

by Proposition [T2j the set (<^>^, eVo contains a pair (1,E S [£]) for every 

strategy s inducing a distinct expected utility. Moreover, since functions with 
empty scope correspond to numbers, the relation < specifies a total ordering over 

the valuations in (<^)^ gVo , which implies a single maximal element. Let s* 
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be a strategy associated to (p,u). Since (p,u) G max ( [0^ e Vo ^ , it follows 
from maximality that E s * [C] > E S [C] for all s, and hence u — MEU[£]. □ 

Proof of Proposition dH Consider a variable D in £ and let Z?i , . . . , D m be the 
corresponding decision variables and X\ , . . . , X m the corresponding chance vari- 
ables in C! . Also, let 

{ y i=l J 

where j/ = {Di, X\, . . . , Z? m _i, X m -\, D m }, and, for each Di, p^ denotes the 
probability mass function that assigns all mass to dj G £Id (hence each set Vd, 
contains \^Id\ functions, and the set Vd x X • • • X VD m has ISIdI" 1 elements). It 
suffices for the result to show that V is equal to Vd- The functions p G V have 
scope equal to {X m } Upa^,, and domain f2{x OT }Upa D = ^fa D - Consider p G V, 
and let w = y U {X m } U pa^,. For x G ^f ar> , let 1 < j < m be such that 



x^*n = nj . Thus, 



P (x)= ^ p^'^o^^on^^^^)- 

For all i ^ j, the values (2/^ faXj ) do not depend on the realization of y^ Di 
Hence, 

where z — {D±, . . . , D m }\{Dj}, and the term inside the parentheses is the sum- 
marginal of Yiijtj Pdi over z. Because each p^ is a probability mass function on 
Di this term equals one and we have that 

m 

p(x)= J2 PdAy iDj )I[px-'(y^)- 

1/Ga;t™\ z i=l 

Now, for all i ^ j, the values p^ Xi {y^**** ) equal one if y^ x * = y-P^-i and zero 
otherwise. In addition, the values p%* 3 (y^ laXj ) equal one if y^ x i = yi- x J-i = 
yWj anc j zer0 otherwise. Hence, the product YiiLi Px*' (y^ faXi ) differs from 
zero only if y^ Xl = ■ • • = yl~*m = y4-Dj ) m which case it equals one. Since 
Pdj (y'" Dj ) equals one if y^ D i = dj and zero otherwise, we have that 



P(x) 



1, if x iXm = dj : 
0, otherwise. 



Notice that for each itj G ^ pai3 there is exactly one d G £Id such that p(d, TVj) — 
1, and hence V is the set of degenerate conditional mass functions on VLd- Since 
the set Vd contains a function p p ^ D for every possible combination of degenerate 
mass functions on £Id (one mass function for each iZj G ^ P a D ) , it follows that 
for each p there is p p ^ D such that p = p v ^ D . Thus, V C Vd- 
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Consider a function p p ^ D G Vd and its associated policy Sjj G Ap. For 
i = 1, . . . ,m let be the function from assigning all mass to di — 5(7Ti). 
Also, let p be a function in V such that SmITJLiPx; * Pck- Then, for d G il_D 
and 7Ti G ^pa D , Pd d {d, 7Tj) = Pdi(d) = p(d,TVi). Hence, for any function 
there is p G V such that = p, and 'Pd CP. □ 

The following inequalities are required for the proof of Theorem 1311 

Lemma 37. For any nonnegative integer k and < x < 1, 

/ x \ k 
l + 2,>(l + -) . 

Proof. From the Binomial Theorem of Elementary Algebra we have that 



because 



/ X\ K ■s-^/K\X X X 

\ 1+ k) = ^\i)¥-^~n-^H 



i=0 v ' i=0 i=0 



k\ _ k(k-l)---(k-i + l) < k l 
i ) il ~ il 



for i = 0, . . . , k. Thus, it suffices to show that e x < 1 + 2x, which is true if and 
only if x < ln(l + 2x). Let f(x) = ln(l + 2x) - x. Then 

{>0, if x < 1/2, 
= 0, if, = 1/2, 
<0, if x > 1/2, 

and therefore f(x) monotonically increases from to 1/2 and monotonically 
decreases from 1/2 to 1. Since /(0) = and /(l) — ln(3) — 1 > 0, it follows 
that / > in [0, 1] and hence x < ln(l + 2x) for x G [0, 1]. □ 

Lemma 38. For any x > 0, 

ln(l + x)> 



1 + x 

Proof. Let f(x) = ln(l + x) — x/(l + x). Then / is a monotonically increasing 
function for x > because 

fix) 



(l + xf ' 

Since /(0) = 0, ln(l + x) - x/(l + x) > for all x > and the result follows. □ 
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